CONSULTING 
PSYCHOLOGY 


AMERICAN roe fe ASSOCIATION 





JOURNAL OF GONSULTING PSYCHOLOGY 


Edited by Laurance F. Shaffer 
Teachers College, Columbia University 






































Associate Editors: Edgar A. Doll, Dewereux Schools, Dewon, Pa. \Villiam A. Hunt, Northwestern Univer- 
rity, BE. Lowell Kelly, University of Michigan, Morris Krugman, Board of Education, New York, N. Y. 
Bertha M. Luckey, Board of Education, Cleweland, Ohio. Ann Magaret, University of Wisconsin. Fred 
McKinney, University of Missouri. Anne Roe, La Jara, N. M. Carl R. Rogers, University of Chicago. 
R. Nevitt Sanford, University of Cailiforuia, 


Lorraine Bouthilet, Managing /-ditor 


The Journal of Consulting Psychology is the clinical journal of the American Psycho- 
logical Association. It is devoted primarily to the publication of original research relevant to 
psychological diagnosis, psychotherapy and counseling, personality, and the dynamics of be- 
havior. Case studies, theoretical contributions, descriptions of clinical techniques, and discus- 
sions of the professional problem of clinical psychology also appear in its pages. 


Editorial Office 


Manuscripts submitted for publication, books and tests for review, and correspondence on 
editorial matters should be sent to the Editor, Laurance I’. Shaffer, 525 West 120th St., New 
York 27, N. Y. 


Information for contributors: The important char- References must follow the exact form used in APA 
acteristics of a good article are the significance of journals. further information on the details of 
its subject matter, the demonstrated soundness of its preparing manuscripts, see: 
conclusions, its clarity, and its brevity. Clarity is Anderson, | E., & Valentine, W. L. The prepa- 
improved by careful outlining, discriminating use of ration of articles for publication in the journals of 


headings, and a simple, straightforward style Of he American Psychological Association. Psychol. 
writing. For brevity, authors should strive to ccmvey gyi! 1944, 14, 345-376, 

every thought without waste of words, and should 
avoid repetitions of methods, data or findings. 
Some of the mechanica! requirements for manu- 
scripts are: Type manuscripts with wide margins, 
on white bond paper. Double space between each 
and every line, including references, footnotes, mat- 
ter to be set in smaller type, and tables. Use only 
two types of headings: centered headings, and un- 
derlined paragraph headings. Type each table, how- 
ever small, on a separate sheet, with Arabic number 
end title. Draw all figures in India ink in final form mitted after January 1, 1952. Authors are charged 
for maproduction, with draftsman’s lettering, but one-half of the extra cost of tabular material and 
type captions for figures on separate sheets. The guts, and the full cost of changes made on the proof. 


There is a lag of about nine to twelve months be- 
tween the acceptance of an article and its publica- 
tion. Authors may secure immediate publication, 
however, by arranging for articles to be printed as 
exira pages, increasing the number of pages that 
subscribers receive. The cost to the author of such 
early publication is approximately $14.00 per print- 
ed page. 

Authors may order reprints when returning proof. 
No reprints are furnished gratis for articles sub- 


Business Office 


Comaminications concerning subscriptions, change of address, claims for the nonreceipt of 
a number, advertising, and other business matters, should be sent to the American Psycholog- 
ical Associaticn, Inc., 1515 Massachusetts Ave., N. W., Washington 5, D. C. 


The Journal of Consulting Psychology is published bi-monthly in February, April, June, 
August, October, and December, the yearly volume comprising approximately 480 pages. The 
subscription per year is $5.00, foreign $5.50, single number $1.00. 


> @opyright, 1952 by the American Paychological Association, Inc. Printed amd mailed by the Dentan 
‘Printing Co., 23 West Colorado Ave, Colorado Springs, Colo. and cntered as second-class matter at the 
Post Office at Colorado Springs, Colo. Acceptance for mailing «t special rate of postage provided for in 
section 538, Act of February 25, 1925, authorized May 27, 1940. 








June, 1952 


CONTENTS 
Frustration Tolerance in Relation to pagnems and divpeads A.R.H am 
and Ross Stagner - - - - - . 
The Semantic Validity of TAT Interpretations: Beverly Fest Davenport - - - 
The Goodenough Draw-A-Man Test and Primary Mental Abilities: H. L. Ansbacher - 


The CVS Abbreviated Individual oe Scale: William A. Hunt 
and Elizabeth G. French - - ~ 


Psychological Services and Professional Problems in the Field of Mental 


Deficiency: Bernard L. Bloom - - - 


Patients Diagnosed Manic Depressive > pada State: 
J. R. Wittenborn and Walter Weiss - - - 


Some Psychological Aspects of Dermatosis: John C. Narciso, Jr. 


Measurable Changes in Empathy with Age: Rosalind F. ee Anne &. 
Hughes, and Virginia L. Raabe - - - 


The Identification and Measurement of Predispositional Factors in Crime and 
Delinquency: Harrison G. Gough and Donald R. Peterson - 


Test versus Academic Performance in Malfunctioning Students: Barbara A. Kirk 
Changes in Stanford-Binet IQ of Retarded Six-Year-Olds: Mary Isabel Elwood 
Psychoneurosis and Psychosomatic Reactions: A Rorschach Contrast: Leo Shatin - 
Rorschach “Shock” for Two Special Populations: Harold F. Uehling - 


A Note on the WISC and Other Tests of Children Six to aes Years Old: 
Betram D. Cohen and Mary J. Collier - - - a 


A Comparison of the WISC and the Wechsler-Bellevue: Lois Delattre and 
DE «se Cw oe eae @ « »« © « « « « » « 


A Note on the Validity of the Wechsler cae Scale for Children: 
Grace Thompson Altus - - - - - - 


ee en a! oe Se a Oe es. eS Ss © ew ew te 








oR 





Frustration Tolerance in Relation to Diagnosis 
and Therapy’ 


A. R. Hybl 


Veterans Administration, Knoxville, lowa 


and Ross Stagner 


University of Illinois 


The interpretation of personality break- 
down as inability to tolerate the burden of 
frustration and conflict encountered by the 
patient has been common for years. It is im- 
plicit in Freud’s treatment of neurosis, and 
has been made explicit in many recent dis- 
cussions of the problem. Considering this 
situation, it is amazing that so few studies of 
an experimental nature have been made on the 
relationship between frustration tolerance and 
clinical diagnosis or therapy. 

Rosenzweig [6] defined frustration toler- 
ance as “the capacity of the individual to 
withstand a given frustrating situation with- 
out distorting the so-called ‘objective’ facts of 
the life situation.” However, the operational 
criterion which he has employed, and which has 
been utilized in most of the research in this 
area, is that of decreased efficiency in dealing 
with an objective situation. 

Brown [1] and Sherman and Jost [8] 
have published observations on the responses 
of clinically diagnosed individuals to experi- 
mental frustration. Brown used an odd prob- 
lem, although he asserts that his patients 
became much involved in attempting to solve 
it. He asked the subject to strike a note on a 
toy piano without moving his feet past a cer- 
tain line. There were two methods by which 
this could be done; after the patient had dis- 


1Received in the Veterans Administration and 
published with the approval of the Chief Medical 
Director. The statements and conclusions published 
by the authors are the result of their own study 
and do not necessarily reflect the opinion or policy 
of the Veterans Administration. From the Veterans 
Administration Hospital, Knoxville, Iowa. 


163 


covered both of these, Brown insisted that a 
third way be found. Many patients apparently 
became quite disturbed as a result of their 
efforts to solve this problem. The experimen- 
ter rated the overt behavior disturbances, and 
verbal reports were obtained afterwards. 
While no quantitative behavior indices were 
obtained, Brown felt that his obsessive-com- 
pulsive neurotics showed most upset (least tol- 
erance), while the schizophrenics were hardly 
disturbed. 

Sherman and Jost [8] presented number 
series, first in simple combinations with easy 
time limits to establish a feeling of success, and 
then in difficult patterns under time pressure, 
inducing a feeling of failure. ‘The neurotics 
again proved most disturbed by failure, the 
schizophrenics least. 


The problems which seemed to us to merit 
further careful study were these: (a) Is 
frustration tolerance a generalized feature of 
the individual personality? Does he show 
equal loss of efficiency when frustrated on dif- 
ferent tasks? (4) How does the behavior loss 
relate to clinical diagnostic catagories? (c) 
Does psychotherapy significantly affect frus- 
tration tolerance as measured in this manner? 


Subjects and Procedure 


The subjects of the present investigation 
were 90 male patients in the Danville Veter- 
ans Administration Hospital.? Seventy of these 


2Our thanks are due to Dr. L. E. Trent, Dr. O. 
K. Timm, and Dr. L. A. Pennington for assistance 
in the arrangements for the research, as well as for 
advice on its execution. 








164 


were neuropsychiatric cases, diagnosed by the 
staff as follows: 20 alcoholics, 15 psychopaths, 
10 schizophrenic reaction, and 25 neurotics.° 
Twenty patients made up our control group. 
They were outpatients coming to the hospital 
for medical care, with no neuropsychiatric 
involvement discernible. They were so chosen 
as to duplicate the age and education distri- 
bution of the neuropsychiatric group. 

For the portion of the study dealing with 
effects of therapy, twenty patients constituted 
a therapy group. These twenty were chosen 
as receiving a minimum of ten sessions of in- 
dividual psychotherapy; the actual mean 
number of sessions was 28.2 hours of therapy. 
For a control group, twenty patients were 
chosen, as far as possible matching the therapy 
group as to age, education, and diagnostic cate- 
gory. These twenty were composed of patients 
who had less than ten hours of therapy, but 
not because of a diagnosis that they would not 
profit from such treatment, nor because of 
any difficulties in treatment. 


An attempt was made to select test materi- 
als which would fit in with the normal oper- 
ation of the hospital. All patients, of course, 
receive a diagnostic study including Wechsler- 
Bellevue, Rorschach, etc. They were highly 
motivated to do well on these tests. Thus it 
was decided to use somewhat similar test ma- 
terials in our study. Preliminary trials with 
patients not included in the final study elim- 
inated several tests because the patients con- 
sidered them too difficult, considered them 
tricky, or were obviously unconcerned about 
failure. Finally we selected three tests for the 
main experiment: (a) the Ferguson Form 
Boards; (4) the digit symbol subtest of the 
Wechsler-Bellevue, Forms I and II; and (c) 
the Minnesota Rate of Manipulation Test, 
from which two scores were derived, for one 
hand and for two hands. In addition, because 
we wanted to focus on reactions to frustration, 
we administered the Rosenzweig Picture-Asso- 


8This constitutes virtually the entire intake of 
cases admitted to intensive treatment wards (with 
the exception of shock treatment) during the period 
of our study. These cases were judged likely to 
benefit from therapy and not likely to require long- 
continued hospitalization. Actually there were 82 
such cases: 70 are reported here; the other 12 left 
the hospital before completing the test battery, or 
would not cooperate in the experiment. 


A.R. Hybl and Ross Stagner 


ciation Test to all subjects. 

The procedure was as follows: each task 
was presented in three phases. In Phase I the 
subject was allowed to do well, and was en- 
couraged. In Phase II speed was emphasized 
and an electric timer with a loud clicking 
noise was set going; the experimenter express- 
ed mild disappointment and the opinion that 
the subject could do better if he tried harder. 
In Phase III standard conditions were re- 
stored, but no statements, either encouraging 
or discouraging, were made. 

In Phase I of the form boards, Ferguson 
Boards one and two were presented. In Phase 
II, Boards five and six were used with pieces 
interchanged in such a way as to prevent suc- 
cess on either. In Phase III, Boards three and 
four were given in standard procedure. 

The Digit Symbol Test was presented as 
follows: In Phase I, the subtest Form I was 
given in standard form. In Phase II, the same 
test was repeated, with warnings after 15, 35, 
and 55 seconds as to where he should be, indi- 
cating limits which were quite unattainable. In 
Phase III, the subtest from Form II was given 
with a 114 minute time limit to agree with 
Form I. 

The Minnesota Rate of Manipulation Test 
was given as follows: In Phase I, the “one- 
hand turning and placing” and “two-hand 
turning and placing” tasks were done under 
standard conditions. In Phase II, only the 
turning task was used, but the subject was 
stopped every ten seconds and told where he 
should be, again indicating an unattainable 
accomplishment. Phase III was a repetition of 
Phase | under standard instructions. 

The tests were administered to patients 
after the psychological study at the time of 
admission, but before therapy began. Two 
sessions were required. At Session I, the 
Rosenzweig test was administered, followed 
by the Digit Symbol Test after a ten-minute 
rest period. At Session II, usually the next 
day, the Form Boards came first, followed by 
the Minnesota test after a ten-minute rest. 

For the therapy group, the tests were re- 
peated at the close of therapy. The only devi- 
ation from the procedure outlined was that 
different words were used in encouraging 
(Phase I) and discouraging (Phase II) the 

















Frustration Tolerance in Relation to Diagnosis and Therapy 


subject, to avoid arousing suspicion. The ther- 
apy control group was retested after time in- 
tervals chosen to match the distribution of 
times in the therapy group.* 


Results 


Generalized character of frustration tol- 
erance. Within each of the diagnostic cate- 
gories, subjects were ranked in accordance 
with the amount of difference between Phase I 
and Phase III. This difference ranked them 
according to amount of disruption induced 
by the frustration. Rank-difference correla- 
tions among the four tasks were then com- 
puted, as shown in Table 1. 


Table 1 


Based on Rank-Difference 
Frustration Effects 
within Diagnostic Groups 


Pearson Correlations 


Correlations: 











Diagnostic group 
Neu- 
Schiz. rot. 


Tests 
correlated 


Psycho- Alco- Nor- 


path hol. mals 





Form board and 

digit symbol 33 21 80 35 78 
Form board and 

one-hand turn 

and place 10 52 61 .68 .76 
Form board and 

two-hand turn 

and place 05 56 65 40 88 
Digit Symbol and 

one-hand turn 

and place .28 58 63 .78 79 
Digit Symbol and 

two-hand turn 

and place 
One-hand and 

two-hand turn 

and place 93 64 .68 .60 71 


It is clear from Table 1 that, for the 
most part, frustration tolerance appears in a 
relatively similar degree on all four tasks. 
The correlations within the normal group are 
high enough to be acceptable as reliability 
coefficients for objective tests, when we consid- 
er the very short testing time involved. The 
neurotics and schizophrenics also give good 
correlations, the alcoholics somewhat less, and 





*All testing was done individually by A.R.H. 
Identical remarks were made to every patient dur- 
ing a given phase of the experiment. 


165 


the psychopaths are decidedly inconsistent in 
their performance. The test for significance 
recommended by McNemar [5, p. 124] was 
applied, but no significant differences were ob- 
tained. When normals were compared with al- 
coholics, neurotics, and psychopaths, ¢ ratios 
approaching the .05 level were gotten, but the 
results must be considered inconclusive. 
Differences between diagnostic categories. 
On_ performance before three 
differences meet the .05 level of significance: 


frustration, 


normals and schizophrenics tend to score be- 


low neurotics, alcoholics, and psychopaths. 
This may have been a motivational phenom- 
enon. On improvement from Trial 1 to Trial 2 
on form boards and Minnesota test, the nor- 
mals did best and the schizophrenics worst 
(.05 level). Detailed tables of these and other 


results will be found in Hybl [4]. 


lable 2 
Means for Diagnostic Groups of the Differences 
Between Trials Immediately Preceding and 
Immediately Following Frustration 








Means 

One-hand Two-hand 

Group Form Digit turnand turn and 
Board Symbol place place 
Psychopaths 41.8 5.0 6.8 0.1 
Alcoholics 45.9 4.1 6.9 2.8 
Schizophrenics 10.5 —0.9 —0.6 —z2.9 
Neurotics 47.8 5.8 9.1 1.4 
Normals 9.6 —3.2 —0.8 —1.3 





Primary concern focuses on the differences 
between Phase I and Phase III. Table 2 
shows the mean difference in score between 
the trial just preceding frustration and that 
just following frustration, for the five groups. 
Table 3 shows the corresponding ¢ ratios for 
these groups. It will be noted that 17 of these 
t values exceed that required at the .001 level 
of confidence, 4 fall between the .001 and 
the .01 level, and 1 falls between .01 and 
.05, the remaining 18 being outside the .05 
level. We seem safe in rejecting the null hy- 
pothesis with respect to the normals and 
schizophrenics versus the neurotics, alcoholics, 
and psychopaths, but unsafe in rejecting it for 
comparisons between normals and _schizo- 
phrenics, or between alcoholics, neurotics, and 
psychopaths. 








166 


A. R. Hybl and Ross Stagner 


Table 3 


Values of ¢ for Diagnostic Group Differences: the Difference Between Trials 
Immediately Preceding and Immediately Following Frustration 











Between 

















Psychopaths and alcoholics 
Psychopaths and schizophrenics 
Psychopaths and neurotics 
Psychopaths and normals hail 
Alcoholics and schizophrenics.......... 
Alcoholics and neurotics 

Alcoholics and normals 
Schizophrenics and neurotics 
Schizophrenics and normals 
Neurotics and normals 














F ratio* 








—— Ty 
t ratios* 
One-hand Two-hand 
Form Digit turn and turn and 
Board Symbol place place 
0.44 0.66 0.05 1.68 
3.21 3.87 3.25 1.78 
0.67 0.60 1.09 0.84 
3.48 5.62 3.50 0.84 
4.34 3.89 3.90 4.01 
0.25 1.60 1.29 1.43 
4.80 6.95 4.38 3.08 
+.78 5.55 5.27 3.21 
0.12 1.74 0.10 1.16 
5.35 8.03 5.82 2.14 
12.3 18.07 13.4 5.17 





*t of 3.43 = 0.1% level = Fof 5.10 
2.63 1% 
1.99 5% 


3.55 


2.48 


The same conclusions are justified if we 
compare, for form boards and rate of mani- 
pulation, the performance on the trial just be- 
fore frustration with the sum of the two 
trials just after frustration. The distribution 
of F and ¢ values is similar, the absolute 
values being slightly reduced. It would like- 
wise be justified if we use the two trials pre- 
ceding and the two trials following frustra- 
tion, but there would be a perceptible re- 
duction in the confidence level if this were 
done. The somewhat poor start of the normal 
group on Trial 1 confuses this particular com- 
parison. 

Is fatigue a factor? The hypothesis might 
be entertained that the fatigability of the 
three “neurotic” groups was greater than 
that of normals and schizophrenics. If that 
were true, the differences should be greater 
and more significant as we include these 
additional pre- and _ post-frustration trials. 
Since the differences actually become less 
significant, it seems more plausible that the 
low-tolerance groups are gradually recover- 
ing from the effects of the frustration when 
they reach the second post-frustration trial. 


Relation to the Rosenzweig frustration 
reactions. To simplify the task of comparing 
Rosenzweig categories, 23 persons were chos- 





en who had the highest frustration tolerance 
(least interference from frustration) and 22 
who had the lowest tolerance. The 23 high- 
tolerance cases included 9 schizophrenics and 
14 The 22 low-tolerance cases 
included 2 psychopaths, 7 alcoholics, and 13 
neurotics. These figures are in harmony with 
the mean differences in Table 2. 


normals. 


How do high and low tolerance groups re- 
spond to the Rosenzweig test? Table 4 gives 
the mean scores on the nine combinations of 
scoring categories and then combined scores 
[cf. 7] and the corresponding ¢ values. It will 
be noted that only one of these reaches the 
.001 level, 5 fall between .001 and .01, 2 fall 
between .01 and .05, and the remaining 7 
fall outside the .05 limit. We feel justified 
in rejecting the null hypothesis as regards 
need-persistent-impunitive reactions (the high- 
tolerance group giving more), and perhaps as 
regards the general impunitive category (high- 
tolerance giving more) and intropunitive 
(low tolerance group giving more). 

Changes following therapy. While the ther- 
apy and therapy-control groups were ade- 
quately matched for initial performance (no 
values even approach the .05 level), there was 
a slight error in matching for frustration tol- 
erance. On the form board, a ¢ ratio of 2.65 








Frustration Tolerance in Relation to Diagnosis and Therapy 


Table 4 
Scores of High and Low Frustration Tolerance 
Groups on the Rosenzweig Picture- 
Frustration Test 








Mean Score 
Response category 








High Low Diff. tratio* 

Obstacle-dominant— 

extrapunitive 1.78 1.45 0.33 0.73 
Obstacle-dominant— 

intropunitive 0.91 0.75 0.16 0.67 
Obstacle-dominant— 

impunitive 1.17 2.34 —1.17 3.16 
Ego-defensive— 

extrapunitive 4.65 5.50 —0.85 1.47 
Ego-defensive— 

intropunitive 2.63 3.59 —0.96 2.40 
Ego-defensive— 

impunitive 5.36 3.77 1.59 2.79 
Need-persistent— 

extrapunitive 3.67 2.09 1.58 2.98 
Need-persistent— 

intropunitive 2.33 3.18 —0.85 1.66 
Need-persistent— 

impunitive 2.54 1.05 1.49 4.38 
Extrapunitive 10.3 9.5 0.8 1.60 
Intropunitive 5.8 7.5. —1.7 2.92 
Impunitive 9.1 7.2 1.9 2.83 
Obstacle dominance 3.9 45 —0.6 0.92 
Ego defense 12.2 128 —.06 0.78 
Need persistence 8.6 6.3 2.3 2.46 

*t of 3.58 = 0.1% level 
2.69 1% 
2.01 6% 


(between .05 and .01) was obtained. After 
therapy, there were no significant differences 
in frustration tolerance between the two 
groups. No values even approach the .05 
level. 

What about changes in tolerance as meas- 
ured by determining the difference in individ- 
ual tolerance scores before and after therapy? 
Table 5 shows these differences and the cor- 
responding ¢ values. It would appear that 
there is a trend toward significance. One t 
value exceeds the .01 level and one exceeds 
the .05 figure. It must be recognized, how- 
ever, that this table is based upon differences 
between differences, and thus may have many 
chance factors influencing it. Since chance fac- 
tors may raise as well as lower a difference, 
we are not highly confident that therapy has 


167 


produced a significant change in frustration 
tolerance. 


Table 5 


Differences Between Experimental and Control 
Groups on the Change in Differences in 
Tolerance After Psychotherapy 








Means 





Test : Diff. tratio® 
Exper. Cont. 

Form Board 19.0 3.6 15.4 3.30 
Digit Symbol 1.7 0.4 1.3 1.44 
One-hand turn and 

place 5.5 1.7 3.8 2.25 
Two-hand turn and 

place —1.7 —0.3 14 0.93 


*t of 3.55 = 0.1% level 
2.71 1% level 


2.08 659% level 


Does the amount and success of therapy af 
fect tolerance? For the 20 experimental cases, 
a rank-order correlation of number therapy 
sessions and rank in improved frustration tol- 
erance amounts to .23. This is not statistically 
significant. When the therapist classified each 
of the 20 into four categories of improve- 
ment—none, slight, moderate, and considera- 
able—there a consistent trend toward 
agreement with the test differences. ‘The mean 
rank (out of the 20 cases) is: 


was 


no improvement, 13.9 
slight, 11.5 

moderate, 10.1 
considerable, 4.4. 


While these figures are difficult to evaluate 
statistically, they suggest that there may per- 
haps be a significant effect of therapy upon 
resistance to experimental frustration. 


Discussion 


Before examining our results in detail, let 
us consider the kind of theoretical framework 
within which they can best be understood. 
The personality can be conceived as an energy 
system in equilibrium, which mobilizes addi- 
tional energy to protect or restore equilib- 
rium [11]. When any stimulus such as the ex- 
perimental situation occurs, it will be per- 
ceived in terms of its probable significance for 
the personality. If it offers tension reduction, 








168 


it will be perceived as a positive valence and 
energy will be mobilized to contact it. If it 
threatens tension increase, it will be perceived 
as negative, and energy mobilized to avoid it. 
If it evokes no probabilities of either tension 
increase or reduction, no energy mobilization 
can be expected. 

In the experimental situation, we ut: zed 
stimuli which preliminary experimentation in- 
dicated would be likely to evoke energy arous- 
al. Further, by giving one successful trial, we 
encouraged the subject to perceive the situa- 
tion as offering tension reduction. The sec- 
ond trial was, therefore, in general, accom- 
panied by more vigorous effort than the first. 

Now, if the organism is in motion toward 
a goal, and is blocked from achieving that 
goal, the frustration may be assumed to be 
proportional to the vigor with which the goal 
is pursued [2]. Additional energy is then mo- 
bilized in an effort to overcome the obstacle. 
This may be due to the fact that any block- 
ing of a goal-directed act will be perceived as 
a signal of deprivation; since deprivation in- 
creases tension, blocking may do the same. 
However, at the human level, this effect de- 
pends upon the perception of the barrier. If 
it is perceived as penetrable, additional energy 
is brought to bear; if the barrier is seen as 
impenetrable, the positive valence is repressed 
and energy is diverted from this act to some- 
thing else. 

G. L. Freeman [3] has presented a sound 
argument, and some experimental evidence, to 
the effect that the consequences of additional 
energy mobilization will be a function of the 
prior tension level. If the organism is in a 
relatively relaxed condition, additional energy 
will lead to a decided increase in efficiency. 
However, if the existing tension level is high, 
additional energy mobilization may increase 
the tension beyond the limits of the indi- 
vidual’s capacity, and disorganization results. 
The energy is dissipated in random muscular 
and ideational activities of all sorts, not chan- 
neled into an adaptive attack upon the frus- 
trating situation. 

Within this framework, the experimental 
results can be interpreted as evidence for dif- 
ferences in the additional energy mobilization 
which can be tolerated by our subjects. The 


A. R. Hybl and Ross Stagner 


concept of frustration tolerance can be equat- 
ed to Freeman’s concept of the limits of 
energy mobilization. The data show convinc- 
ingly that our diagnostic groups differ with 
respect to the amount of disorganization in- 
troduced by the experimentally induced fail- 
ure, 

Was the failure a frustration? Direct ob- 
servation of the subjects’ behavior, as well as 
their verbalizations, indicated that the frus- 
trating trial was perceived as such by most of 
them. This was particularly true in the neu- 
rotic and alcoholic groups. The schizophren- 
ics, on the other hand, gave few overt indi- 
cations of frustration, and their performance 
scores showed no disruption of efficiency on 
the subsequent trial. 

We interpret this difference as follows: 
The neurotics, alcoholics, and psychopaths 
perceived the experimental situation, espe- 
cially after the first successful trial, as offer- 
ing tension reduction. The experimenter gave 
them mild praise, and the task was moderate- 
ly attractive. Thus, on the second trial, addi- 
tional effort was put into the performance. 
Now a blockage is encountered. To most of 
this group it must have been perceived as a 
direct threat to the self, since many of them 
had long histories of failure in various activ- 
ities. The amount of tension aroused, there- 
fore, was more than proportional to the frus- 
tration; it was also a function of this negative 
valence which was introduced into the situa- 
tion. Thus the total tension level must have 
increased appreciably from Trial 2 to Trial 3. 

But this is by no means the whole story. 
The typical neurotic (using this term for our 
three diagnostic groups) may be presumed to 
operate on a higher tension level than the nor- 
mally-adjusted personality. He lives in a 
world full of threats and frustrations. His 
physiological responses to such stimuli are 
more vigorous than those of the normal per- 
son. Thus it may be assumed that, at the be- 
ginning of the experiment, he was much clos- 
er to his limit of tolerance than the controls. 
When this is combined with the presumably 
greater arousal as a result of the frustration, 
the disruption of his behavior is understand- 
able. 

What can this theory offer to explain the 

















Frustration Tolerance in Relation to Diagnosis and Therapy 


results on the schizophrenics? We suggest 
that the schizophrenic has achieved a stable 
equilibrium by repressing both positive and 
negative valences. He denies that it is im- 
portant for him to succeed on the task, he also 
denies that it matters if he fails. By drasti- 
cally restricting his phenomenal field, he pro- 
tects himself against any disturbance. This 
need not mean that his total tension level is 
low; he may indeed have a _ considerable 
amount of energy mobilized to maintain this 
artificial barrier against outside stimulation. 
However, no change is produced by the frus- 
trating experience, or, more precisely, his de- 
fenses operate in such a manner that the fail- 
ure is not frustrating. There is consequently 
no interference with efficiency on Trial 3. 

The normal individual, we assume, does mo- 
bilize energy for the task at hand, and is frus- 
trated by failure. He has, however, a consid- 
erable backlog of successful experiences, and 
consequently does not magnify this particular 
frustration as does the neurotic. Furthermore, 
he has enough leeway left between his normal 
operating tension level and his ceiling that he 
can tolerate more tension without material 
disruption of behavioral control. Thus the 
norma! and the schizophrenic make similar 
scores, but the dynamic processes involved 
must be quite different. 


Why is the tolerance level a generalized 
phenomenon, appearing in several different 
tasks? We have assumed in the foregoing 
analysis that it is a physiological function, a 
threshold beyond which additional tension 
overflows directly into response without con- 
trol. There is considerable evidence that this 
is a justifiable assumption [9, 10]. In addi- 
tion, the importance of the self concept must 
be recognized. Every failure is a threat to the 
self. If the individual has experienced many 
failures, an additional one will presumably 
be more threatening, more traumatic—until 
he reaches the point of despair and adopts the 
schizophrenic tactic of refusing to perceive 
them. Each person in our experiment can be 
thought of as falling somewhere on a con- 
tinuum from “self perceived as adequate” to 
“self perceived as overwhelmed.” Since this 
will be a common factor in the four tasks 
used, it also plays a part in accounting for the 


169 


similar disruption in all four performances. 

The results from the Rosenzweig test are 
in harmony with this interpretation. ‘The low- 
tolerance group has a high proportion of in- 
tropunitive responses, and especially of ego- 
related intropunitive items. The high-toler- 
ance cases have more extrapunitive and im- 
punitive responses. Now the intropunitive pat- 
tern is associated with a feeling of weakness 
and helplessnes; the individual is afraid to 
adopt an extrapunitive stand, and he does not 
feel secure enough to take an impunitive po- 
sition. It is probable that he has in the past 
responded with extrapunitive aggression and 
been punished for so doing. He now turns his 
aggression inward upon himself rather than 
take this chance again. The impunitive reac- 
tion denies the existence of frustration, and 
thus is the characteristic defense of the schizo- 
phrenic and of many normals; the neurotic 
experiences a frustration which is too strong 
to be treated in this fashion. 

What about the effects of therapy? Rosen- 
zweig [6] proposed that psychotherapy be vis 
ualized as a “process of building up frustra- 
tion tolerance by allowing the patients, as in 
the psychoanalitic situation where frustrations 
are constantly discussed, to experience small 
or tolerable doses of frustration until resist- 
ance is gradually developed.” This analogy 
with immunization does not seem very plau- 
sible. What does fit with recent research on 
the effect of therapy is that there is a change 
in perception (a) of threatening situations, 
and (4) of the self. Stork [12], for example, 
has shown that with progress in therapy there 
is a decline in the evaluation of other persons 
as threatening, and an increase in positive self- 
evaluation. 


As we noted above, the significance of our 
results on this problem should not be over- 
estimated. Nevertheless, the trend in three of 
the four tasks is toward higher tolerance after 
therapy, and two of these ¢ values are high. 
It is our interpretation that therapy has oper- 
ated to reduce the perceived threat in the fail- 
ure situation, or to enhance the person’s self- 
evaluation so that he is less severely disturbed 
by the experimental failure, or both.® 


5It must also be conceded that the differences may 
have been small because the therapy was not maxi- 








170 


Summary 


1. Experimental frustration was induced 
by giving one or two successful trials on sim- 
ple tasks, followed by imposed failure. ‘The 
impact of the frustration was measured by the 
disturbance of performance on the succeeding 
trials. 

2. Normals and schizophrenics showed 
high frustration tolerance (little disruption 
of efficiency) ; neurotics, alcoholics, and psy- 
chopaths showed substantial disruption. Neu- 
rotics, alcoholics, and psychopaths did not dif- 
fer significantly from each other, but were 
significantly different from the other two cate- 
gories. 

3. Frustration tolerance, as measured by 
disruption, is a generalized aspect of the in- 
dividual personality in so far as the four tasks 
are a representative sample. Psychopaths show 
the least degree of generalization in this re- 
gard. 

4. Therapy tends to increase frustration 
tolerance, although the differences are not 
large. 


Received October, 15, 1951. 





mally effective. Many of these cases were handled 
by interns and in most cases the treatment was not 
continued beyond the point at which the patient 
could return to work. 


~ 


. McNemar, Q. 


A. R. Hybl and Ross Stagner 


References 


Brown, J. F. Reactions of psychiatric patients 
in a frustrating situation. Bull. Menninger 
Clin., 1939, 3, 44-64. 

Dollard, J., et al. Frustration and aggression. 
New Haven: Yale Univer. Press, 1939. 
Freeman, G. L. Energetics of human behave- 
ior. Ithaca, N. Y.: Cornell Univer. Press, 1948. 
Hybl, A. R. Frustration tolerance and its re- 
lation to diagnosis and therapy. Urbana, IIli- 
nois: University of Illinois Library, 1951. 
Psychological statistics. 
York: Wiley, 1949. 

Rosenzweig, S. Frustration as an experimental 
problem. Charact. @ Pers., 1938, 7, 126-128; 
150-154. 

Rosenzweig, S., ef al. Revised scoring manual 
for the Rosenzweig Picture Frustration Study. 
J. Psychol., 1947, 24, 165-208. 

Sherman, M., & Jost, H. Frustration reactions 
of normal and neurotic persons. J. Psychol., 
1942, 13, 3-19. 

Shock, N. W. Physiological factors in behav- 
ior. In: J. McV. Hunt (Ed.), Personality and 
the behavior disorders. New York: Ronald 
1944. 

Stagner, R. Psychology of personality. New 
York: McGraw-Hill, 1948. 

Stagner, R. Homeostasis as a unifying concept 
in personality theory. Psychol. Rev., 1951, 58, 
5-17. 

Stork, D. An investigation into the interrela- 
tions between the self-concept and feelings di- 
rected toward other persons and groups. J. 
consult. Psychol., 1949, 18, 176-180. 


New 


Press, 








The Semantic Validity of TAT Interpretations” 


Beverly Fest Davenport 


University of Southern California 


Previous validate projective 
tests have been criticized for being either too 
atomistic or too holistic [1, 4, 5]. In the form- 


er case, elements of the Taw data are counted 


attempts to 


and correlated with something, a process in 
which objectivity is retained, but 


and richness of the data are lost. 


the meaning 
The holistic 
indicate that 
there is something valid about the combination 
of projective test, interpretation, and clinician, 


methods, such as blind matching, 


[7]. They retain the richness of the raw data. 
learned from blind 
matching studies about the specific strengths 
and weaknesses of the interpretative process. 
Several have suggested that what 
needs to be studied, at present, are the separate 
interpretative statements which make up a full 
test interpretation [1, 3, 4, 5, 6]. This ap- 
proach will have the shortcoming that the con- 
text will be lost, but it promises to be fruit- 
ful in providing clues as to how statements in 
test interpretations could be improved, in or- 
der to avoid semantic confusions. An interest 
in studying the separate interpretative state- 
ments was the starting point for this study. 
The validity of projective test interpre- 
tations cannot be demonstrated unless there is 
a high degree of inter-interpreter reliability in 
the making of interpretations. The usual 
measure of inter-rater reliability involves the 
average per cent agreement or average inter- 
correlation between each possible pair of 


However, nothing is 


authors 


1The material contained in this paper is abstract- 
ed from a Ph.D. thesis submitted to the Graduate 
Faculty of the University of Southern California. 
The paper is published with the permission of the 
Chief Medical Director, Department of Medicine 
and Surgery, Veterans Administration, who assumes 
no responsibility for the opinions expressed or the 
conclusions drawn by the author. 

“This study was conducted at the various facil- 
ities of the Veterans Administration Regional Office, 
Los Angeles, while the author was employed by 
them as a trainee in clinical psychology. 


/ 


raters. These figures are ambiguous in their 
meaning and often do not lend themselves to 
tests of significance. They may be spuriously 
high due to a large number of test items which 
do not discriminate between the subjects or 
records being studied. When they are low, it 
is not known whether the test 


biguous, the process of interpretation invalid, 


item was am 


or the clinicians unreliable. ‘The discrimina- 
bility 


been investigated. 


of interpretative statements has never 

Day-to-day clinical experience in trying to 
understand test interpretations leads to the im- 
pression that different clinicians attach differ 
ent meanings to the most common concepts, 
and that they are ambiguous. 

In line with the problems discussed thus 
far, the concept of reliable-discrimination was 
developed. A_ reliably-discriminating state 
ment is one which is used in the same manner 
by most clinicians, and which discriminates be- 
tween data to which it applies and data to 
which it does not apply. Those statements 
which are reliably applied to all data or no 
data are excluded because they do not discrim- 
inate. Those statements applied by the same 
proportion of clinicians to all data, or which 
might be applied by some clinicians to any data, 
are also excluded for the same reason. What 
are left are nonambiguous (operationally de- 
fined), The 
purpose of this study was to investigate reliable 
discrimination. 


nonuniversal statements. major 


In addition, universality and 
ambiguity were studied. 


Procedure 


TAT records were obtained from a hetero- 
geneous group of adult male veterans. Four 
were patients at the Veterans Administration 
Neuropsychiatric Hospital at Los Angeles. 
They included one patient diagnosed as psy- 
chotic (‘‘dementia praecox, simple type, in- 


competent”), one character disorder (“pas- 








172 


sive-dependency reaction’), and two neurotics 
(“psychoneurosis with hypochondriacal fea- 
tures,’ and “obsessive-compulsive”). The re- 
maining two subjects, considered normal, were 
successfully employed at white collar jobs, had 
no abnormally high MMPI scores, and had 
never been considered in need of psychiatric 
treatment. They suffered from none of the dis- 
orders often considered to be of psychosomatic 
origin, such as ulcers, hypertension, asthma, 
migraine, diabetes, spastic colitis, or glandular 
disorders. 

The records from these subjects were inter- 
preted by six clinical psychologists in training 
with the Veterans Administration Regional 
Office, Los Angeles. These were not the same 
persons who had administered the tests, since 
direct contact would have provided them with 
additional clues. The aim in selecting the per- 
sons who were to do the interpretations was to 
obtain a representative sample of one class of 
“TAT users.” We were not interested in 
studying the performance of a single interpre- 
ter, but rather in permitting a representative 
range of individual differences in language 
usage to enter. 

The interpretations thus obtained were di- 
vided into separate statements. When a state- 
ment contained several ideas, it was separated 
into as many statements, and both the original 
complex statement and the separate parts were 
retained. Obvious duplications were removed. 
This process resulted in a sample of 207 inter- 
pretative statements. 

The statements were studied to see that al- 
ternative and complementary statements were 
present, in relation to each topic covered. 
When such statements did not exist, the 
author invented a statement. Forty-three 
statements were thus added temporarily, but 
were eventually discarded for reasons to be ex- 
plained later. 

The statements were placed on separate 
cards. Once a week for six weeks, each of six 
experienced clinical psychologists saw one of 
the six original TAT records, and decided 
which of the interpretative statements applied 
to it, and which of them did not. A tally was 
made of the total number of times each state- 
ment was applied to each protocol. (Examples 
are shown in Table 1.) These tallies served 
as the basis for the evaluation of reliable-dis- 


Beverly Fest Davenport 


crimination. 


Table 1 


Number of Times a Statement was Applied to Each 
Record, and Totz!] Use 








Patients 
State- Total 
ments A B Cc D E F Use 
1 6 0 2 3 2 4 17 
2 0 0 0 1 1 1 3 
3 6 6 6 5 5 5 33 
4 3 4 5 4 3 2 21 
5 3 3 3 3 3 3 18 


Etc. 


It would have been desirable to rank all of 
the statements according to how well they dis- 
criminated between patients but no statistical 
methods have yet been devised for dealing with 
this type of problem. 

It was possible, however, to group the state- 
ments into seven categories based on a con- 
tinuum of reliable-discrimination, and to deter- 
mine whether or not the data offered evidence 
of significant deviation from chance.’ Reli- 
able-discrimination was considered to be great- 
est when all six judges agreed that there was 
a difference in the applicability of a statement 
to at least two of the six records (Table 1, 
Statement 1). It was at a minimum when all 
of the records received the same number of ap- 
plications of the statement. Seven categories 
of reliable-discrimination were based on the 
width of the range of numbers of applications 
to each record, for each statement. Each cate- 
gory was defined by a limiting pair of records. 
This can best be understood by reference to 
Table 2, which reveals the systematic basis for 
forming each category, and the number of 
statements which fell into it. 


In addition to the above procedure, the 207 
statements were rated on two six-point rating 
scales by twentv-six clinical psychologists not 
previously connected with the study. The 
scales dealt with ambiguity (statements with 
a vague or double meaning) and universality 
(statements applicable to any adult male). 
Mean ratings were obtained for purposes of 
ranking. 


8The equations for this problem were developed 
by Bernard Sherman, Ph.D., professor of mathe- 
matics at the University of Southern California. 














The Semantic Validity of TAT Interpretations 


Table 2 


Definitions of Categories of Reliable-Discrimination 








Ob- 
Range ser. 
freq. 


Explanation Limiting pairs 





At least one record 0-6 7 12 
received six 

judgments of 

“applies” and at 

least one record 

received no judg- 

ments of “applies” 


At least one record 0-5; 1-6 6 


te 
“ 


received five judg- 
ments of “applies” 
and at least one rec- 
ord received no 
judgments of “‘ap- 
plies,” or vice 
versa; or at least 
one record received 
six judgments of 
“applies” and at 
least one record 
received one judg- 
ment of “applies,” 
or vice versa 


Et cetera 0-4; 1-5; 2-6 39 
0-3; 1-4; 2-5; 3-6 4 72 
0-2; 1-3; 2-4; 3-5; 3 34 


’ 


WN 


Et cetera 
Et cetera 


+6 

Et cetera 0-1; 1-2; 2-3; 3-4; 2 13 
4-5; 5-6 

Et cetera 0-0; 1-1; 2-2; 3-3; 1 0 
4-4; 5-5; 6-6 





The total number of times each statement 
was applied by some rater was tallied, and 
statements ranked according to amount of use. 

Statements were divided into two groups on 
the basis of their tendency to be ignored by all 
the raters (not applied to any record by more 
than three raters), or to be used readily (the 
balance). This frame of reference will be call- 
ed agreed-unused. 


Results 


Reliable-discrimination. It will be noted in 
Table 2 that only 12 of the 207 statements led 
to sets of judgments wherein a minimum of 
two of the six records had been discriminated 
with complete agreement among the six judges. 
An examination of the tallies for these 12 state- 


173 


ments revealed that for 7 of them, only the 
two records had been thus discriminated; for 
4 statements, three records; and for | state- 
ment, four records. In no instance was an in- 
terpretative statement applied to as many as 
five or six records in such a way that the 
judges divided the records into two groups, to 
one of which a statement would have applied, 
and to the other of which it would have been 
nonapplicable. In other words, either the state- 
ments were not used at all, or they were ap- 
plied indiscriminately to everyone, or by the 
same proportion of raters to everyone, or they 
failed to between 
patients. 


discriminate most of the 
In spite of the predominance of a lack of 
reliable-discrimination, it was possible to dem- 


onstrate (Table 3) that it exceeded chance ex 


Table 3 
Chi Square: Obtained versus Chance Frequencies i: 


Five Categories of Reliable-Discrimination* 


wes Obtained 


Rar Expected D?/f, 
7.6 49 S.A 73.8 
5 39 48.2 1.6 
4 72 78.5 a 
3 34 $3.4 5.5 
y 13 11.2 2 

x2 81.6 


p= 18.8 at .01 with 4 degrees of freedom. 


*Yates’s correction for continuity was made when ob 
tained frequency was less than 50. 
pectancy. Chi square was 81.6. With four 


degrees of freedom, a chi square of 13.3 is 
significant at the .01 level. Table 3 also indi- 
cates that the only significant contribution to 
the chi square came from the upper ranges. 

The statements from the upper ranges were 
compared by the author with those in the 
lower ranges to see if any definable qualitative 
differences could be observed. None presented 
themselves. More specifically, it made no ap- 
parent difference whether the statements re- 
ferred to symptoms, defenses, dynamics, family 
relationships, sexuality, ego strength, domi- 
nance-submission, or feelings. 

No significant statistical relationships were 
found between reliable-discrimination, and uni- 
versality or ambiguity. 

Universality and Use. The Pearson corre- 
lation between the mean ratings of universal- 
ity and the total use of each statement was .43 








Beverly Fest Davenport 


‘Table 4 


Obtained and Expected Frequencies for Agreed-Unused Groups and Universality 


Univer Agreed-Used 
sality Obtained Expected 
Most 36 34.1 
Middle 93 80.4 
Least 21 35.5 
Totals 150 150.0 


cn 


£ .057, indicating that judges tended to prefer 
the use of universal statements and avoid the 
use of the more specific ones. If the more spe- 
cific statements had been applied by many raters 
to only one patient, they would have fallen 
low on the use scale, adding to the above 
correlation, but this would have had quite a 
different interpretation than if each rater ap- 
plied a statement once to a different patient. 
Defining as agreed-unused those statements 
which had never been applied to any record 
more than three the balance as 
agreed-used, the frequencies expected by 
chance of the 47 least universal, 49 most uni- 
versal, and 111 of the middle group were ob- 
tained, and chi square computed. Table 4 
shows the obtained and expected frequencies 


times, and 


for the various categories. Chi square was 27.0. 
With two degrees of freedom, a chi square 
of 9.2 is significant at the .01 level. 

Ambiguity. Ambiguity and use were not 
correlated. The Pearson coefficient was —.06 
+ .069. The correlation between ambiguity 
and universality was negligible, being .12 + 
.069. 

The author examined the statements at both 
extremes of the ambiguity scale to see on 
what basis the raters had made their judg- 
ments. The most ambiguous statements were 
(a) heavily loaded with psychoanalytic ter- 
minology (“transfer his object,” “introjec- 
tion,” “oral satisfaction,” “persona,” etc., as 
well as more generally used jargon such as 
“denial,” “break with reality,” “affectional 
needs,” “schizoid tendency,” “inadequate per- 
son,” “over-ideational preschizophrenic,” 
“superego”); (4) contained terminology 
which does not have a widely known or agreed 
upon meaning (“His concept of masculinity 
is to be assertive and not aggressive,” “There 
are signs of severe sex shock’); or (c) were 


Amount of Use 


Acreed-Unused Totals 


Obtained Expected Obtained Expected 
11 12.9 47 47.0 
18 30.6 111 111.0 
28 13.5 49 49.0 
57.0 207 207.0 


clumsily worded (“It may be postulated that 
the mother was moderately frustrating to the 
subject, yet had some inconsistency” ). 

In contrast with the above, the least ambig- 
uous statements contained no psychoanalytic 
terminology except for a few uses of words 
referring to mechanisms of defense (‘‘projec- 
‘“‘identify’’). 
Otherwise it was difficult to define objective 


characteristics of a nonambiguous statement. 


tion,” “intellectualization,” 


statements. The six 
judges who studied the TAT’s tended to avoid 
the use of the 43 complementary statements 
added by the author. Only 14 of them were 
applied by five judges to at least one record. 
Table 5 shows the obtained and expected fre- 


quencies of statements used less than three 


A uthor-originated 


Table 5 
Author-Originated Statements Used Less Than 
Three Times versus the Balance 


Obtained Expected 
Used less than three times 21 11.8 
Balance 22 31.2 


times, and the balance. Chi square was 8.48, 
significant beyond the .01 level. This seemed 
to offer evidence that the statements added by 
the author came from a different universe of 
interpretative statements than those obtained 
from the other clinical psychologists, so they 
were excluded from further statistical treat- 
ment. 

A systematic qualitative examination of the 
author-originated statements was attempted. It 
appeared possible that the judges were avoiding 
the use of statements referring to positive 
traits or assets of personality. Such avoidance 
did occur in spite of the fact that two of the 
TAT records were from normal subjects and 








The Semantic Validity of TAT Interpretations 175 


should have provided an opportunity for the 
use of the statements. A related observation is 
that the original interpretations contained al- 
most no references to positive traits or assets of 
personality. 


Summary and Conclusions 

Six clinical psychologists studied six TAT 
records taken from a heterogeneous group of 
subjects and decided whether or not each of 
207 typical interpretative statements and 43 
author-originated statements applied to each 
TAT, making possible an evaluation of reli- 
able-discrimination. 

In addition, the statements were rated by 
twenty-six other clinical psychologists accord- 
ing to degree of ambiguity, and later, accord- 
ing to degree of universality. 

1. ‘The most significant observation was the 
degree to which absence of reliable-discrim- 
ination predominated. It significantly 
better than chance according to an unexacting 


was 


criterion. However, in no case was an inter- 
pretative statement used with complete accord 
by six judges to discriminate among all six 
patients. A validation study would have been 
doomed to failure, and not necessarily because 
f faulty hypotheses. 


Of} 


2. Judges tended to apply statements rated 


as universal to any patient, and to “hedge’’ or 
avoid the use of more specific statements. 

3. Qualitative examination suggested at 
least two factors worthy of further study: (a) 
the most ambiguous statements were heavily 
loaded with psychoanalytic terminology or 
other jargon, while the least ambiguous state- 


ments contained few of these references; (>) 
statements referring to positive traits or assets 
of personality were absent from the original 
sample of interpretative When 
statements of this nature were added by the 


statements. 


author, the raters avoided using them. 

The main contribution of the experiment is 
to suggest a flexible methodology for the study 
of interpretative statements, and to call at 
tention to the need for further research into the 


habits of projes tive test interpreters. 


Received February 
Early 


18, 10 2 


Publication 


References 


| Cronbach, I lI. A validation design for quali 
tative studies of personalit: J. consult Psychol, 
1948, 12, 365-374 

?, Davenport, Beverly F. The semantic validity of 


PAT 


ertation 


interpretation I npubli hed doctor di 


Univer. Southern California, 1952 


; De Groot, A. D. Some preliminary remarks to 
1 methodology of psychological interpretation 
On falsification Icta psychol 1950, 7, 196 
224 

} Krugman, M. Review of Rorschach test. In O 
kK juros (Ed Third mental measurements 
yearbook Ne Brunswick N | Rut 


' 


Univer. Press, 1949. Pp 


5 Rotter, |. S. The status of the Ror 


schach in clinical and experimental 


132-133 
present 


| rocedures 


J. Pers. 1948, 16, 304-311 
6. Rotter, J. 8. Review of Thematic Apperception 
Test. In O. K. Buros (Ed.), Third mental meas 


urements yearbook. New 


Brunswick, N. J.: Rut 


gers Univer. Press, 1949 Pp 205-206 
7. Shneidman, Edwin S., with Walther Joel and 
Kenneth B. Little. Thematic Test Analysis 


New York: Grune and Stratton, 1951 








The Goodenough Draw-A-Man Test and 
Primary Mental Abilities 


H. L. Ansbacher 


University of Vermont 


The Goodenough Draw-a-Man Test [3] 
held the third place among psychometric in- 
struments in frequency of use according to 
total mentions in a survey of psychological 
clinics conducted by Louttit and Browne [8] 
in 1946. It was exceeded only by the Stanford- 
Binet (SB) and the Wechsler-Bellevue (WB). 
In 1935 the Goodenough Test had held 
the twelfth place; the increase in popularity 
seems the more remarkable if one considers 
that many new tests have appeared during this 
interval. Yet in spite of its popularity, the 
Goodenough has been correlated only with 
measures of general intelligence (almost ex- 
clusively the Stanford-Binet), never with any 
factors of intelligence. This statement is based 
on the review by Goodenough and Harris [4, 
pp. 384-388] and was confirmed in a private 
communication from Dr. Harris. The purpose 
of the present study was to provide some 
quantitative basis regarding the relationship 
of the Goodenough to specific aspects of gen- 
eral ability. 


Procedure 


The subjects were 100 fourth-grade chil- 
dren from Burlington, Vermont public 
schools.t Twenty-five children were taken 
from each of four schools. None of the classes 
had more than 30 pupils; to keep the number 
of subjects down to 25, the most overaged 
children were excused from taking the tests. 
One of the schools was located in a poorer 
neighborhood, one in a mixed, and two in 


1The writer wishes to express his sincere grati- 
tude to Misses Catherine C. Cartier and Mary K. 
McDermott, principals, and Mrs. Erna L. Benedict 
and Misses Jean Carlson, Mary E. Farrell, and 
Ethel P. Symes, teachers, for their very kind and 
generous cooperation. 


better neighborhoods. In one class in the better 
neighborhood the brightest children had been 
promoted to fifth grade prior to our study. 
The mean age of the subjects was 10 years 
with an SD of 7 months. Their mean IQ on 
the Goodenough was 101.8 (SD 14.25, range 
74-131), and on the SRA Primary Mental 
Abilities Test (PMA) the total-score mean 
1Q was 97.95 (SD 12.52, range 69-131). Re- 
garding age, background, and test performance 
the group was more homogeneous than the 
U. S. white population at large, a factor which 
will play a part in the discussion of results. 
As the most convenient instrument for ob- 
taining measures of specific aspects of intelli- 
gence, Thurstone’s Primary Mental Abilities 
Test (PMA) for ages 7-11 [14] was used. 
It provides scores of the following abilities: 


V erbal-Meaning (V), the ability to under- 
stand ideas expressed in words; 

Space (S), the ability to think about objects 
in two or three dimensions; 

Reasoning (R), the ability to solve logical 
problems—to forsee and plan; 

Perception or Perceptual-speed 
ability to locate details 
accurately ; 

Number (N), the ability to work with 
figures—to handle simple quantitative 
problems rapidly and accurately. 


(P), the 
rapidly and 


In addition, a Total Score and a Total 
Nonreading Score can be computed. All 
scores are convertible into quotients, and it is 
these on which our correlations are based. 

Since it is very conceivable that Good- 
enough Test performance might be related to 
motor ability, which is one of Thurstone’s 
primary factors for ages 5-7, the subjects were 


176 








Goodenough Draw-a-Man and Primary A bilities 


also given three subtests of the MacQuarrie 
Test for Mechanical Ability [13] which are 
most highly loaded with this or similar fac- 
tors. These are the Tracing, Tapping, and 
Dotting subtests. ‘The remaining four subtests 
could be dispensed with because they are most 
highly loaded with the spatial factor which is 
tapped by the PMA. The factorial information 
regarding the MacQuarrie is taken from 
Harrell’s study [5]. We are aware that a 
factorial study with 10-year-old children 
would most likely yield results somewhat 
different from those obtained by Harrell with 
mill but we had 
nothing better to use. Since no norms for our 
age group are available, we correlated the raw 
scores of the MacQuarrie subtests with Good- 
enough IQ’s after having found that the 
correlation of these raw scores with age, for 
our group of very limited age range, was zero. 

The tests were administered by the writer 
on April 2, 3, 4, and 5, 1951, beginning in 
the morning at 9. The order was MacQuarrie 
subtests, Goodenough, PMA, with an inter- 


Table 1 


Product-moment Correlations of Goodenough IQ’s 
with PMA Quotients and MacQuarrie Sub- 
tests Raw Scores; Together with Means 
and SD’s of the Different Variables 


cotton machine fixers; 











(N= 100) 
1 2 3 + 
r with 
Name of Variable M SD Good- 
enough 
1Q 
PMA Quotients 
Words vocabulary (Vw) 99.3 12.9 23 
Pictures vocabulary (Vp) 102.8 17.7 19 
Total Verbal-Meaning 
(V) 100.4 139 .26 
Space (S) 101.1 228 .38 
Word-grouping (Rw) 103.3 20.1 .28 
Figure Grouping (Rf) 97.0 16.1 34 
Total Reasoning (R) 99.2 15.7 40 
Perception (P) 98.2 16.4 37 
Number (N) 95.6 11.7 .24 
Total Nonreading 98.3 11.6 45 
Total Score 97.9 12.5 41 
$+R+P 99.7 15.0 48 


MacQuarrie raw scores 


Tracing 22.3 6.1 34 
Tapping 21.8 4.6 .23 
Dotting 14.3 2.7 16 





177 


mission during the PMA. Total testing time 
was about two hours. Rapport was excellent 
throughout; the subjects enjoyed the PMA as 
much as the two preceding tests. 


Results 


In all, fifteen correlation coefhicients were 
calculated, as presented in Table 1. In Column 
1 the variables with which the Goodenough 
was correlated are listed. Columns 2 and 3 
show the means and SD’s of these variables. 
Column 4 presents the product-moment cor 
relations of the variables with the Goodenough 
scores. The standard errors of the coefficients, 
calculated by the formula 1-r* over VN — 1, 
are .10 for the two r’s under .20; .09 for the 
nine r’s from 20-38; and .08 for the four r’s 
of .40 and over. 


Table 2 
Correlation of Goodenough IQ’s with PMA Quo- 
tients and MacQuarrie Raw Scores, in 





r with 
Variable Goodenough IQ 
Space + Reasoning + Perception 48 
Total PMA Nonreading AS 
Total PMA score 41 
Reasoning (R) 40 
Space (S) 38 
Perception (P) 37 
MacQuarrie Tracing 34 
Verbal Meaning (V) .26 
Number (N) 24 
MacQuarrie Tapping .23 
MacQuarrie Dotting 16 





To make the results more meaningful, the 
coefficients are presented in rank order in Table 
2. The subscores for Verbal-Meaning and 
Reasoning are omitted, thus reducing the 


number of coefficients to eleven. 


Discussion 


Absolute significance of the correlations ob- 
tained. None of the correlation coefficients ob- 
tained are high; even the three highest 
coefficients—those for the complete PMA 
(equivalent to general intelligence) ; for the 
nonreading parts of the PMA; and for the 
mean of Space, Reasoning, and Perception 








178 H. L. Ansbacher 


factors—are only in the forties. These and 
all our other coefficients must not be taken 
entirely at their face value, however. They 
are all depressed by the fact that our group 
was more homogeneous than the standard- 
ization populations of the PMA and of the 
Goodenough. On the PMA total scores we 
obtained an SD of only 12.5 IQ points, 
whereas the standardization population had 
an SD of 15. While this figure is not stated in 
the Manual, it can be estimated from the 84th 
and 16th percentiles in terms of raw scores 
for the age group. Our group then was about 
17 per cent less variable on the PMA than the 
standardization group. On the Goodenough 
the SD of our group was only 14.25 IQ points 
which is 25-30 per cent less than the SD’s of 
19-21 reported by others [3, 7, 10]. 

While, according to McNemar, “it can be 
shown that double selection, i. e. , curtailment 
of the range on both variables, tends to de- 
press the correlation coefficient,” “the formulas 
for ‘correcting’ for double curtailment are not 
too satisfactory” [12, p.127]. Thus he prefers 
not to give any. If the PMA range alone had 
been curtailed, the three highest coefficients 


would have been raised about .08 each; if the 
Goodenough alone, the increase would have 
been about .10. ‘These are estimates from Mc- 
Nemar’s Table 17, p. 126. 

‘Thanks to the fact that Goodenough [3, p. 
51] presents the complete scatter diagram for 
her Binet-Goodenough IQ correlation it was 
possible to use her data to calculate the corre- 
lation for a range of 70 to 130 IQ points 
which corresponds to our range of 69-131. 
The correlation with the Binet for the reduced 
range was .63 whereas the correlation for the 
original range of 40 to 170 had been .74 with 
an SD of 21. But even for the reduced range 
of Goodenough the SD was still 16.3 which is 
two points higher than our SD of 14.25. Had 
it been possible to select from the Goodenough 
raw data a group completely comparable to 
ours in respect to low variability, the corre- 
lation coefficient with the 1916 SB would 
have come still closer to the coefficient obtained 
with PMA total-score 1Q’s. 

In Table 3 is presented a survey of corre- 
lations obtained by other workers between the 
Goodenough and the SB, and in one instance 
the Cornell-Coxe Performance Test. Only 


Table 3 
Survey of Correlations of the Goodenough Test with Measures of General Intelligence, 
Presented in Chronological Order 














Subjects r 
CA MA Meas. of gen. with 
Author N Description years years intelligence used Goodenough 
Goodenough [3] 334 normal 4-10 — 1916 SB IQ 74 
Yepsen [16] 37. defective boys 9-18 5-11 1916 SB MA .60 
McElwee [9] 45 subnormal 4-12 1916 SB MA 72 
Earl [2] 113. feebleminded adults 5-9 1916 SB 48 
Williams [15] 100 normal 4-12 1916 SB IQ 65 
MA 80t 
McHugh* [11] 90 normal Kinder- — 1937 SB IQ Al 
garten Forms MA AST 
L&M 
Havighurst [7] 66 Midwest — 1937 SB IQ 50 
whites Cornell IQ .63 
Coxe 
Birch [1] 68 defectives 10-16 upto 1937 SB IQ 62 
11 
Harris [6] groups University Kinder- — 1937 SB MA 65 
of parents garten to .75 


23-25 





*The first study of correlation with the 1987 SB. “These r’s probably are somewhat lower than will be ob- 
th all subjects tested on ome form of the Binet test” [11, p. 124]. 


tained wi 


tNote that where IQ and MA rs have been computed the latter are higher. 








Goodenough Draw-a-Man and Primary Abilities 179 


studies in which the subjects were of the 
chronological or at least the mental ages for 
which the Drawing ‘Test had been standard- 
ized by Goodenough are included. 

We are not trying to determine what influ- 
ences may have raised or lowered the corre- 
lations presented in Table 3, nor do we wish 
to open the discussion of the validity of the 
Goodenough Test. We only wish to state that 
our three highest coefficients (.41, .45, and 
48), the only ones comparable to measures 
of general intelligence, are still within the 
range of those found by others for tests of 
general intelligence, although our coefficients 
are depressed by the homogeneity of the group. 

Relative significance of correlations ob- 
tained. ‘The relative position of the eleven 
coeficients of “Table 2 must be regarded with 
a great deal more confidence than their abso- 
lute values because whatever statistical, ex- 
perimental, or cultural influences may have 
operated, these were the same for all eleven 
instances. 

The total-score r of .41 should be taken as 
our point of reference, this total score coming 
closest to a test of general intelligence. When 
reading items are eliminated, the r is raised to 
45, which would indicate that the Good- 
enough performance is not related to reading 
proficiency. 

The Reasoning subtest of the PMA alone 
correlates .40 with the Goodenough, which is 
almost as high as the correlation of the total 
PMA with the Goodenough. This confirms 
Goodenough’s original belief “that the test 
throws considerable light on the development 
of conceptual thinking in young children” [3, 
p. 82], the PMA Reasoning being a subtest of 
concept formation. 

The next highest subtest r’s are .38 for 
Space, the ability to think about objects in two 
or three dimensions, and .37 for Perception, 
the ability to locate details rapidly and accu- 
rately. Ihe combined score for Reasoning, 
Space, and Perception yields the highest r of 
our study, .48. This would be a confirmation 
of Havighurst’s hypothesis as to what the 
Goodenough measures, namely, “the ability to 
form concepts based upon observation” [7, p. 
59], an elaboration of Goodenough’s original 
hypothesis. 


$y contrast, the Goodenough performance 
draws only slightly on the ability to under- 
stand ideas expressed in words as shown by the 
r of .26 with Verbal Meaning; and still less 
on the ability to work with figures, as shown 
by the r of .24 with Number. 

The r’s of the Goodenough with the Mac- 
Quarrie items lead to the following consider- 
ations, assuming that the factor loadings ot 
the MacQuarrie items in 10-year-old children 
bear some similarity to those reported by 
Harrell [5] for adults. The lowest r in 
‘Table 2, .16, 


Dotting ‘Test, which requires making dots 


is the one for the MacQuarrie 
precisely. This item showed as high a load- 
ing with “agility,” or dexterity, or coordi- 
nation, as any of the mechanical ability tests 
included in Harrell’s study, while at the same 
time the contribution of other factors was 
negligible. This then is a confirmation of 
another of Goodenough’s contentions, namely, 
that in the drawing test “the importance of 
eye-hand coordination is greatly lessened” [3, 
p. 72]. 

Next lowest is the r of .23 for the Mac- 
Quarrie Tapping Test, which requires making 
dots rapidly. Interestingly enough the highest 
loading of this test is Harrell’s verbal fac- 
tor, the second highest, “agility.” and the r 
obtained here comes close to the r of .26 ob- 
tained for Thurstone’s Verbal-Meaninz 
factor. 

Among the MacQuarrie items, Tracing 
correlated highest with the Goodenough, the 
r of .34 coming close to the r of .37 obtained 
for Thurstone’s Perception factor. This test 
requires the tracing of a narrow path and 
shows the highest loading with Harrell’s fa 
tor of “youth,” the next highest loading with 
perception. “Youth” is described as “‘youth- 
ful willingness to follow directions.”” We 
should like to underline “willingness” as the 
most interesting part of this description at this 
point, it being a personality characteristic rath- 
er than a special ability. This then is new evi- 
dence that the Draw-a-Man Test is related to 
personality factors, a fact of which Good- 
enough herself was well aware [3, p. 80], and 
which since has come into its own through 
the development of drawing-a-person as a pro- 








180 


jective technique with an entire literature of 
its own. 


Summary 


In a study correlating the Goodenough 
Draw-a-Man ‘Test with factors of general 
ability, the elementary form of Thurstone’s 
Primary Mental Abilities ‘Test was used and 
the Tracing, Tapping, and Dotting subtests 
of the MacQuarrie. ‘he subjects were a more 
than 100 
fourth-grade children. The results may be 
summarized as follows: 

Leaving open the question of the absolute 
extent to which the Goodenough measures 
general intelligence or factors of intelligence, 
it may be stated regarding the relative degree 
to which the Goodenough taps various special 
abilities : 


average homogeneous group of 


1. That it is most highly correlated with the 
factors of Reasoning (r = .40), Space (r = 
.38), and Perception (r = .37), which con- 
firms similar hypotheses by Goodenough, and 
especially by Havighurst who stated that the 
drawing test measures “the ability to form 


~ ~ ' P ] . , : ” 
concepts based upon observation. 


in common with the 


.26) and the Number 


2. That it has little 
Verbal Meaning (r = 
(r = .24) factors. 

3. That it has least in common with the 
MacQuarrie Tapping (r = .23) and Dot- 
ting (r = .16), primarily tests of manual dex- 
terity, or simple coordination, 

4. That it is correlated with Tracing (r= 
.34), a test loaded with “youthful willingness 
to follow directions,” almost as highly as with 
Perception, “willingness” being an attribute of 
personality. This can be taken as new evi- 
dence that Goodenough performance is re- 
lated to personality. 


Received October 9, 1951. 


11, 


12. 


13. 


14. 


16. 


H. L. Ansbacher 


References 
Birch, J. W. The Goodenough Drawing Test 
and older mentally retarded children. Amer. 
J. ment. Def., 1949, 54, 218-224. 
Earl, C. J. C. The human figure drawings of 
adult defectives. J. ment. Sci., 1933, 79, 305- 
328. 
Goodenough, Florence L. Measurement of in- 
telligence by drawings. Yonkers, N, Y.: World 
Book Co., 1926. 
Florence L., & Harris, D. B. 
Studies in the psychology of children’s draw- 
ings: II. 1928-1949. Psychol. Bull., 1950, 47, 
369-433. 
Harrell, T. W. A factor analysis of mechani- 
cal ability tests. Psychometrika, 1940, 5, 17-33. 
Harris, D. B. Personal communication. 
Havighurst, R. J., Gunther, N. K., & Pratt, I. 
E. Environment and the Draw-a-Man Test: 
The performance of Indian children. J. ab- 
norm. soc. Psychol., 1946, 41, 50-63. 
Louttit, C. M., & Browne, C. G. The use of 
psychometric instruments in psychological clin- 
consult. Psychol., 1947, 11, 49-54. 
McElwee, E. W. The reliability of the Good- 
enough intelligence test used with subnormal 
children fourteen years of age. J. appl. Psy- 
1932, 16, 217-218. 
McHugh, G. Changes in Goodenough IQ at 
the public school kindergarten level. J. educ. 
Psychol., 1945, 36, 17-30. 
McHugh, G. Relationship between the Good- 
enough Drawing a Man Test and the 1937 


Goodenough, 


ics. J. 


c hol., 


Revision of the Stanford-Binet Test. J. educ. 
Psychol., 1945, 36, 119-124. 
McNemar, Q. Psychological statistics. New 


York: Wiley, 1949. 

MacQuarrie, T. W. MacQuarrie Test of Me- 
chanical Ability. Los Angeles, Cal.: Califor- 
nia Test Bureau, 1925. 

Thurstone, L. L., & Thurstone, T. G. Exami- 
ner manual for the SRA Primary Mental Abil- 
ities, Elementary, for ages 7 to 11. (Prelimi- 
nary ed.) Chicago: Science Research Associa- 
ates, 1948. 

Williams, J. H. Validity and reliability of the 
Goodenough intelligence test. Sch. @ Soc., 
1935, 41, 653-656. 

Yepsen, L. N. The reliability of the Good- 
enough Drawing Test with feeble-minded svub- 
jects. J. educ. Psychol., 1929, 20, 448-451. 











The CVS Abbreviated Individual 


Intelligence Scale’ 


William A. Hunt and Elizabeth G. French’ 


Northwestern University 


For the past few years the authors and their 
colleagues have been engaged in the develop- 
ment and evaluation of brief individual in- 
telligence scales [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11]. Our goal has been a brief scale, easily 
administered, that would offer an adequate 
measure of intelligence at the same time that 
it was serving as a rough diagnostic screen. 
The most promising of the scales we have used 
is CVS consisting of Comprehension (C) and 
Similarities (S) from the Wechsler-Bellevue 
scale and a 15-word vocabulary test (V) 
adapted by Thorndike from the Stanford- 
Binet. We have already published results from 
the original standardization group of 528 
Naval recruits and a cross-validational group 
of 511 recruits, plus several clinical samplings 
[4, 5, 7]. Further work now has confirmed 
the satisfactory performance of CVS and ex- 
tended our normative groups to a total of over 
2,000 cases. In view of the continuing stability 
of CVS in use, we are now offering final 
norms based upon our total findings to date. 


In suggesting CVS as an adequate scale for 
clinical use, we should like to stress that our 
definition of “adequate” is an operational one 
and is to be found in the results reported be- 
low. Here is how CVS performs. Compared 
to longer tests some slight shrinkage in reli- 
ability and validity is inevitable owing to the 
brevity of the scale. Its clinical screening per- 
formance is far from perfect, but compares fa- 


1This study is part of a larger project subsidized 
by the Office of Naval Research under their policy 
of encouraging basic research. The opinions ex- 
pressed, however, are those of the individual au- 
thors and do not represent the opinions or policy of 
the Naval service. 


“Currently with the Personnel Research Labora- 
tory, HRRC, Lackland Air Force Base, San An- 


tonio, Texas. 


181 


vorably with other screening devices. For the 
purposes for which it was intended, we con- 
sider CVS adequate. As with any other test, 
the prospective user must draw his own conclu- 
sions after examining the validational data. 


CVS as a Measure of Intelligence 


CVS now has been administered individ- 
ually to a total of 1,649 Naval recruits be- 
tween the ages of 17 and 25. The majority of 
these subjects were 17 and 18, the mean being 
17.6. Their educational level ranged from 6 to 
15 grades completed with a mean of 10.5. 
These men have all been given the Navy Gen- 
eral Classification Test. The GCT scores for 
the group ranged fom 22 to 76 with a mean 
of 53.3 and a standard deviation of 10.6. This 
compares favorably with the original Navy 
mean of 50 and standard deviation of 10. The 
distribution was slightly skewed in a negative 
direction as might be expected from the fact 
that our mean is a little higher than the mean 
of 50 obtained by the Navy on a wartime 
population. In general, however, this group 
may be assumed to be fairly representative of 
the male population within this age range. 


Table 1 
CVS Data (Standard Scores) For 1,649 
Naval Recruits 








CVS o V S 
Range 11-51 3-17 3-18 3-17 
Mean 30.9 10.6 9.9 10.4 
Sigma 6.9 2.5 2.7 2.7 
r GCT 80 58 69 65 
rC — — 47 56 
rV = sale = 55 





A summary of the validational data on this 
group of 1,649 cases is presented in Table 1. 








182 William A. Hunt and Elizabeth G. French 


The data are presented in terms of standard 
scores. Wechsler’s standard scores were used 
for Comprehension (C) and Similarities (S) 
and the standard scores for Vocabulary (V) 
were calculated by us using Wechsler’s tech- 
nique [5]. The total score for CVS as a 
whole is obtained by adding the standard 
scores of the three subtests. This has seemed 
justified since multiple correlations between 
the GCT scores and CVS subtest scores have 
always approximated closely the Pearson r ob- 
tained by using a total CVS score in which 
each test was weighted equally. 

The CVS total scores ranged from 11 to 
51 with a mean of 30.9 and a standard devi- 
ation of 6.9. The distribution is normal ac- 
cording to the chi-square test of goodness of 
fit. The correlation of CVS with the criteri- 
on, GCT, is .80. The subtest means for C, V, 
S respectively were 10.6, 9.9, and 10.4, with 
sigmas of 2.5, 2.7, and 2.7. The subtest corre- 
lations with GCT were .58 for C, .69 for V, 
and .65 for S. C and V showed the lowest 
intercorrelation, .47, while S correlated .56 


with C and .55 with V. 
Table 2 


CVS Norms (Standard Scores) in Sigma Units 
for 1,649 Naval Recruits 











CVS CVS 
Sigma Total Sigma Total 
Units Scores Units Scores 
+-3.0 51.6 —0.5 27.5 
+2.5 48.2 —1.0 24.0 
+2.0 44.7 —1.5 20.6 
+1.5 41.3 —2.0 17.1 
+ 1.0 37.8 —2.5 13.8 
+0.5 34.4 — 3.0 10.2 
Mean 30.9 





Norms for the total CVS score are given 
in Table 2 based on our 1,649 cases. We are 
presenting these in terms of sigma units as we 
feel these to be most satisfactory. They may 
be translated into such other equivalent scores 
(IQ, etc.) as the clinician may judge suitable. 
In using these norms the clinician should keep 
in mind that while the size of the sampling is 
large, the age range is limited. 


CVS as a Screen Test for 
Mental Deficiency 


One of the uses of such a brief intelligence 


scale is screening possible mental defectives for 
intensive testing. Mental deficiency is a serious 
diagnosis with important social consequences 
for the individual concerned, and should never 
be made on the basis of an abbreviated test 
alone. The brief test, however, can be used 
with profit to select those individuals who are 
in need of more careful examination. In all 
we have now tested 167 institutionalized men- 
tal defectives of the simple familial type, un- 
complicated by added organic or emotional 
factors. Selection was made on the basis of 
unanimous staff diagnosis at the institution in- 
volved. Their ages ranged from 17 to 26 with 
a mean of 19.9 and a standard deviation of 
2.8. CVS scores ranged from 2 to 25 with a 
mean of 12.1 and a standard deviation of 5.0. 
The group represents the “higher” type of 
institutionalized mental defective and includes 
many of the borderline cases that are difficult 
to detect in clinical practice. We previously 
have suggested the use of a cut-off point to 
indicate possible mental deficiency and the de- 
sirability of further testing [4, 5]. The selec- 
tion of a cut-off point is an arbitrary matter 
depending upon the demands of the screening 
situation. If one uses as a cut-off point the 
score of 24 or one sigma below the mean, one 
would pick up 99 per cent of the mental de- 
fectives and only 13. per cent of the group of 
1,649 normal recruits. This would insure de- 
tecting most of the mental defectives, but 
would include a fairly high group of normals. 
If one desired to exclude more normals, a 
score of 17 or two sigma below the mean would 
detect 85 per cent of the mental defectives and 
include only 2 per cent of the normals. CVS 
thus seems a suitable screening device for iso- 
lating possible mental defectives for further 
study. 


CVS as a Screen Test for Indicating 
Possible Psychopathosis 


In constructing CVS we designed it not 
only as a test for intelligence but also as a 
screen for possible psychopathosis. Herein we 
feel lies one of its advantages over the 
numerous similar brief test batteries that we 
and others have proposed. It was not antici- 
pated that it would offer any final diagnostic 
criteria, but merely that it might be used as a 
preliminary screen to sift out individuals who 


Dine w 


era enn 








eee 








The CVS Abbreviated Individual Intelligence Scale 183 


might benefit from further, intensive testing. 
To achieve this we included two tests, Compre- 
hension and Similarities, which are known te 
be sensitive to pathosis, and Vocabulary, which 
is relatively resistant. The difference between 
Vocabulary scores and scores for other abilities 
has been a classic approach to the problem of 
diagnostic scatter measures and we felt that 
in the relatively simple setting of our brief 
battery it might prove even more efficacious 
than in the complexity of numerous subtests 
that mark the longer inteiligence scales. 

Our clinical groups included 145  insti- 
tutionalized schizophrenics ranging in age 
from 16 to 43 with a mean of 28.1 and sigma 
of 6.3, 73 organic cases (50 paretics and 23 
arteriosclerotics) ranging from 35 to 71 with 
a mean of 55.6 and sigma of 12.3, and the 167 
mental defectives described in the first part 
of this paper. These include cases previously 
reported on [4, 5, 9] plus some new ones. 
Since many of our clinical cases were much 
older than the Naval recruits, it was necessary 
to add further control groups from a normal 
population of comparable age. To our basic 
control group of 1,649 normal recruits we 
therefore added two groups of older normal 
subjects, one consisting of male and female 
subjects ranging in age from 25 to 55 witha 
mean of 38.1 and sigma of 8.9, the other male 
and female subjects from 56 to 88 with a 
mean of 66.4 and sigma of 8.6. 


Table 3 


Scatter Data for All Groups 








Scatter Score 


Cc+S 
watt 
% Scatte ing 2 








Group N (V>C&S) M SD 
Recruits 1649 21 +.32 2.30 
Normals 
(25-55 yrs.) 125 $2 — 1.67 2.56 
Normals 
(56-88 yrs.) 147 82 — 3.33 2.55 
Schizophrenic 145 58 —2.14 3.96 
Organic 73 88 — 5.04 2.82 
Defective 167 11 +-1.05 1.82 





The results are summarized in Table 3. 
Originally, when we were dealing largely with 
young adults, we proposed as our criterion for 
scatter the appearance of a Vocabulary score 


higher than the scores on both Comprehension 
and Similarities (V>C and S). This was done 
on the grounds that the simpler the criterion 
and the less calculation necessary in using it, 
the more practical it would be for the busy 
clinician. As Table 3 demonstrates, this meas- 
ure differentiates significantly between our 
young adults (recruits) and the pathological 
groups (mental defectives excluded). Our 
test would thus appear serviceable as a screen 
for possible psychopathosis although its 
“roughness” is indicated by the appearance of 
a false positive group of 21 per cent among the 
normals. With 21 per cent of our normals 
showing scatter the measure remains indica- 
tive rather than finally diagnostic. A further 
study [1] has shown that this scatter in the 
normal group is positively related to intelli- 
gence (r = .35). While no numerical correc- 
tion is possible in our scatter formulae as the 
result of this finding, any clinician using them 
should keep in mind that they become less val- 
id as the intelligence level of the subject being 
tested rises. 

When we deal with older subjects (as we 
tend to when we test organic patients) our 
simple measure, V > C and §, is no longer 
serviceable since scatter normally increases 
with age as Table 3 shows. With this simple 
measure as a criterion, 52 per cent of our 
middle-aged (25-55) group and 82 per cent 
of our older (56-88) group show scatter. As 
Matarazzo has shown, however, it is possible 
to maintain the differentiating ability of scat 
ter on CVS if one substitutes for the simple 
scatter measure a measure which takes ac- 
count of the amount of scatter rather than the 
mere presence of scatter [9]. We are sug- 
gesting therefore that with subjects of middle 

~+S 
—-— V be 


age and above the measure — 





used. This offers quantitative scores with scat- 
ter appearing as a minus value. As Table 3 
again indicates, this re-establishes satisfactory 
differentiation between our older normals 
and our clinical groups. While both tend to 
show scatter, the clinical groups show the 
greater amount. It should be possible to es- 
tablish age norms for scatter to extend the 
diagnostic possibilities of CVS, although the 
present data are not extensive enough for this 








184 


purpose. 

A word concerning the relative lack of 
scatter among mental defectives is in order. 
Our sample of mental defectives was selected 
on the basis of simple familial mental defi- 
ciency uncomplicated by organic or emotional 
involvements. When this criterion is adhered 
to we have found consistently that our mental 
defectives show less scatter than normals of 
the same age. If we add feebleminded cases in- 
volving organic pathosis or emotional prob- 
lems, this difference disappears indicating that 
the appearance of scatter among groups of 
mentally defective subjects is positively related 
functional 
cases. This is further confirmation of the re- 
lation of scatter (as we have defined it) to 


psychopathosis. 


to the inclusion of organic and 


A further desirable characteristic of any in- 
dividual intelligence test for clinical use is that 
its items should possess clinical richness; that 
is, they should offer clinical cues which can 
serve as a basis for qualitative diagnostic in- 
terpretation by the examiner. In a study by 
Klehr [8] a group of experienced clinicians 
were asked to examine the CVS responses of 
a group of normals, schizophrenics, and men- 
tal defectives and to make a differential diag- 
nosis purely on the basis of the qualitative na- 
ture of the responses without recourse to any 
objective scoring. That they were able to rec- 
ognize the schizophrenic and mental defective 
records at a level significantly above chance 
expectancy shows that even an abbreviated 
battery such as CVS can offer a basis for valid 
clinical judgments of a qualitative sort. 

We have conducted one more “clinical” 
study, previously unreported, involving the 
testing with CVS of 111 Naval disciplinary 
cases confined to the brig at the Great Lakes 
Naval Training Center. The work arose from 
our interest in devising some diagnostic test 
for psychopathic personality of the asocial type. 
As anyone working in the field will realize, it 
is very difficult to secure diagnosed “‘psycho- 
paths” in any large number for experimental 
purposes, so we have been forced to use dis- 
ciplinary cases as a substitute under the hy- 
pothesis that such groups contain a larger 
loading of psychopaths than does a normal 
sampling of the population. In the case of our 
Naval disciplinary group we have incomplete 


William A. Hunt and Elizabeth G. French 


and fragmentary psychiatric data indicating 
that this hypothesis is correct. Our original in- 
terest in using CVS with psychopaths sprang 
from an unpublished study by Dorothy Rass- 
mussen involving the diagnostic possibilities of 
the McGill Verbal Situation Test [6] when 
used to differentiate a normal sampling from 
a group of Municipal Court cases. The total 
CVS score was used as a means of matching 
the control and experimental groups for in- 
telligence. No differences appeared between the 
groups on the Verbal Situation Test but care- 
ful inspection of our CVS data indicated that 
something peculiar was happening to the sub- 
tests in the CVS battery. Further exploratory 
work of ours with scattered disciplinary cases 
and a few hospitalized psychopaths indicated 
that two things might be occurring. In the first 
place, there seemed to be a tendency for the 
scores on Comprehension to be lower than on 
Similarities and Vocabulary. This trend ap- 
peared to be attributable to a tendency on the 
part of the disciplinary cases to give bizarre, 
“asocial” answers to questions 1, 3, 4, and 8. 
We received answers such as “read the damn 
thing” in response to the question on what to 
do if one finds a stamped, addressed envelope, 
and “‘you tell me’’ in response to the query 
concerning why we pay taxes. These occurred 
only rarely among our normal population. In 
the second place, when the order of presen- 
tation of the subtests was varied, a tendency 
appeared for performance on the test pre- 
sented first to be lower than when the same 
test was presented later in the order. This 
we assume to be attributable to the difficulty 
of establishing rapport in testing psychopathic 
personalities. It is possible that rapport is poor 
at the beginning of testing but improves as 
the testing session develops. 

These expected tendencies appeared in our 
111 Naval disciplinary cases. The score on C 
tended to be lower than V contrary to our 
normal sampling, and this difference was ac- 
centuated, making C lower than both V and 
S, when C was placed first in the order of 
presentation. Unfortunately the differences, 
while confirming our hypothesis, were not sta- 
tistically significant. They are, however, sug- 
gestive and seem worthy of further investiga- 
tion. One necessary step would be to refine 
the experimental group by using as a criterion, 








The CVS Abbreviated Individual Intelligence Scale 185 


not disciplinary infraction, but an established 
psychiatric diagnosis of psychopathic personal- 
ity. To date we have not been able to locate 
such available cases in the large numbers de 
manded by the problems of test validation. It 
might also be worth while to remodel Compre 
hension including new items encouraging an- 
swers of an asocial sort. 

In all our normal samples the order of the 
means of the subtests is C, S, V; C being the 
highest. These differences are not statistically 
significant, but have not varied for eight suc- 
Naval disciplinary 
group, on the contrary, the order was V, C, 
S. Our disciplinary group also showed a con 
siderable amount of scatter (V > C and S). 
Since the disciplinary group seemed of lower 


cessive samples. In our 


average intelligence than our normal control 
group of 1,649 (with a mean total 
CVS score of 25.3 for the disciplinary cases 
compared to 30.9 for the normals) and since 
we had already demonstrated a 


cases 


relation be- 
tween scatter and intelligence and scatter and 
age, some check seemed necessary with intel- 
Unfortunately 
Navy GCT scores were not available for the 
disciplinary 


ligence and age controlled. 


cases. Drawing upon our files, 
it was possible to match 74 of the 
and total CVS score with 74 
schizophrenics and 74 normal recruits. The 
mean age of the three groups was 22.3 and 
the mean total CVS score 26.5. Table 4 shows 


how ever, 


cases for age 


Table 4 
CVS Data for Schizophrenics, Naval Disciplinary 
Cases, and Normal Recruits Matched for 
Age and Total CVS Score 

( V S % 

Scat- 

Group VN M SD M SD M SD ter 
Schizo 

phrenic 74 8.1 2.7 10.1 3.3 84 2.7 51 
Discipli- 

nary 74 8.9 2.6 514 S23 BS 2 21 

Recruit 74 9.7 2.7 8.0 2.7 86 2.6 10 


the results. The order of difficulty for the re- 
cruits is C, §, V; for the schizophrenics V, 
S, C; and for the disciplinary group V, C, S. 
This is in accord with our previous findings. 
In terms of the percentage showing scatter 


as we have defined it the schizophreni ivi 
51 per cent, the disciplinary cases 21, and the 
controls 10. This places the disciplinary cases 
midway between the controls and the schizo 
phrenics and again suggests the appearance of 
some atypical picture among the disciplinary 
cases. There would seem at least a possibility 
that further investigation and _ refinement 
might lead to the discovery of test perform 


The low 


per cent (10) of scatter among these normal 


ance differences at a diagnostic level. 
controls as compared to the total recruit group 
of 1,649 
ment. This small group is of lower intelligence 
CVS than the larger 
1,649 CVS 


30.9) Since we have demonstrated a positive 


(21 per cent) deserves com 


CASCS 


(mean score of 26.5) 


group of cases (mean score of 
relationship between intelligence and scatter 


[1] this difference is to be expected. 


Findings on CVS 


Further 

There remain to be mentioned a few other 
characteristics of the CVS battery. While we 
older 


adequate age 


do not have sufficient numbers in our 
possible 


CVS shows the eX 


pe ted performance decrement with advan ng 


population to make 


norms at the older levels 


age. Moreover, as would be expected, scatter 
is seen to be a function of age. Both in tern 
of the mere presence of utter and of the 


amount of scatter exhibited there is an increase 
as we progress from our recruit group through 
the middle-aged group to the elderly. 

The problem of reliability is a bothersome 
one. Split-half measures are unsatisfactory on 
such a brief test; and a test-retest approach 
because of the memory factor on such a short 
scale, demands a longer interval between test 
ings than we could obtain with our recruits, 
who were at the training station only for a 
relatively short time. We have therefore relied 
on the indirect implication of reliability im- 
plicit in repeated validity. It was possible, how 
ever, to retest 116 of our mental defectives 
after a period of approximately one year. The 
correlation obtained was .81. In view of the 
relatively small range of scores within this lim- 
ited group we find this encouraging and in 
dicative of satisfactory reliability. 

Finally, Pollaczek recently has demonstrat 
ed that CVS can be keyed for the detection of 


malingering [11]. Her study showed that it 








186 


was possible to develop a key that would de- 
tect roughly 90 per cent of a group of experi- 
mental malingerers while falsely identifying as 
malingerers only 10 per cent of a genuinely 
feebleminded control group. This performance 
is acceptable for a screening test, and adds a 
valuable attribute to CVS when it is used in 
situations where malingering is an important 
possibility to be guarded against. 


Received August 2, 1951. 


References 


1. French, Elizabeth G., & Hunt, W. A. The re 
lationship of scatter in test performance to in- 
telligence level. J. clin. Psychol., 1951, 7, 95- 
98. 

2. Hunt, W. A., & French, Elizabeth G. A sec- 
ond fifteen-word vocabulary test for use with 
abbreviated intelligence scales. J. consult. Psy- 
chol., 1949, 13, 124-126. 


3. Hunt, W. A., & French, Elizabeth G. Some 
abbreviated individual intelligence scales con- 
taining non-verbal items. J. consult. Psychol, 
1949, 13, 119-123. 


4. Hunt, W. A., French, Elizabeth G., Klebanoff, 


~ 


10. 


William A. Hunt and Elizabeth G. French 


S. G., Mensh, I. N., & Williams, M. The 
clinical possibilities of an abbreviated indi- 
vidual intelligence test. J. consult. Psychol., 
1948, 12, 171-173. 

Hunt, W. A., French, Elizabeth G., Klebanoff, 
S. G., Mensh, I. N., & Williams, M. Further 
standardization of the CVS individual intelli- 
gence scale. J. consult, Psychol., 1948, 12, 355- 
359. 

Hunt, W. A., French, Elizabeth G., & Mensh, 
I. N. An abbreviated form of the McGill ver 
bal situation test. J. clin. Psychol., 1948, 4, 
406-411. 

Hunt, W. A., Klebanoff, S. G., Mensh, I. N 
& Williams, M. The validity of some abbrevi 
ated individual intelligence scales. J. 
Psychol., 1948, 12, 48-52. 

Klehr, H. Clinical intuition and test scores as 
a basis for diagnosis. J. consult. Psychol., 1949, 
13, 34-38. 

Matarazzo, J. D. A study of the diagnostic 
possibilities of the CVS with a group of or- 
ganic cases. J. clin. Psychol., 1950, 6, 337-343. 
Mensh, I. N. The effects of test abbreviation 
upon responses to the individual items. J. clin. 
Psychol., 1949, 5, 22-36. 

Pollaczek, P. P. A study of malingering on the 
CVS abbreviated individual intelligence scale. 
J. clin. Psychol., 1952, 8, 75-81. 


” 


consult. 








Psychological Services and Professional Problems 
in the Field of Mental Deficiency 


Bernard L. Bloom 


Mansfield State Training School and Hospital 


Descriptive statistics published by the Fed- 
eral Security Agency [3] indicate that at the 
conclusion of 1948 there were 121,426 resi- 
dent patients reported by state institutions for 
the care of mental defectives and epileptics. 
This figure represented a 2.3 per cent increase 
over the preceding year. Were this rate of 
increase to be maintained, at the end of 1950 
there would have been approximately 127,000 
patients at such institutions. 


Since most institutions employ clinical psy- 
chologists, several hundred individuals are ac- 
tively engaged, or have professional interest 
in, the problems pertaining specifically to the 
psychological care of the mental defectives. 
The 1950 membership list of the American 
Association on Mental Deficiency [4] records 
332 psychologists in that organization, a fig- 
ure representing 20 per cent of the total mem- 
bership. With nearly one hundred institutions 
located at various points in the United States, 
and with several hundred psychologists directly 
or indirectly connected with them, it would be 
of marked professional interest to assess the 
activities and needs of the departments of psy- 
chology at these institutions. Inter-institution- 
al communication is often not of a highly effici- 
ent nature and it is difficult to achieve an over- 
view of the activities of so large and so geo- 
graphically dispersed a group of individuals. 

The present survey was designed to yield in- 
formation concerning aspects of the problem 
of mental deficiency which would prove of 
equal interest, relevance, and meaning to psy- 
chologists and to the administrators who share 
in the formulation of policies and activities of 
their departments of psychology. Three general 
areas were investigated: (a) present activities 
of departments of psychology, (4) needs of de- 
partments of psychology not now being met, 


187 


and (c) nature of additional services which 
could be offered were such present needs ful- 
filled. 

A questionnaire and letter were distributed 
to 79 institutions. The names of these institu- 
tions were obtained from the 1950 register of 
the AAMD [4] and included all state insti- 
tutions having patient populations limited to 
mental defectives, epileptics, or both. Ques- 
tionnaires were not sent to state institutions 
which handled the mentally ill in addition to 
mental defectives, since an attempt was made 
to derive information regarding psychological 
activities specifically organized for institution- 


Table 1 
Questionnaires Distributed and Returned as a 
Function of Geographical Area 
Re- 
Area N turns % 

New England 4 4 100.0 
Maine, N.H., Vt., 
Mass., R.L., Conn. 

Pacific 5 5 100.0 
Wash., Ore., Calif. 

Middle Atlantic 14 13 92.9 
N.Y., N.J., Penn. 

Mountain 7 6 85.7 
Mont., Idaho, Wyo., Colo., 
N.M., Ariz., Utah, Nev. 

West South Central 4 3 75.0 
Ark., La., Okla., Tex. 

West North Central 10 7 70.0 
Minn., Iowa, Mo., N.D.., 
S.D., Neb., Kans. 

East North Central 14 9 64.3 
Ohio, Ind., Ill., Mich., Wisc. 

South Atlantic 12 7 58.3 
Del., Md., D.C., Va., 
W. Va., N.C., 8.C., Ga., Fla. 

Fast South Central 4 1 25.0 
Ky., Tenn., Ala., Miss. 

Total 79 «60 75.9 











188 Bernard L. Bloom 


alized mental defectives. Seven weeks after the 
initial questionnaire was mailed, another copy 
of the questionnaire was sent, together with an 
explanatory note, to institutions which had not 
earlier responded. No attempt has been made 
to distinguish, in the analysis of the data, be- 
tween institutions which responded to the 
initial questionnaire and those which responded 
after the follow-up letter. 

Returns were obtained from 60 institutions, 
or 76 per cent of the sample. ‘The total patient 
population of the responding institutions was 
97,000 representing 76 per cent of the esti- 
mated total national population of institution- 
alized mental defectives. The geographical dis- 
tribution of the initial sample and the ques- 
tionnaire returns are shown in Table 1. It can 
be seen that returns were obtained from all 
geographical areas, although there were fewer 
returns from the southeastern section of the 
country. It seems clear, however, that a fairly 
representative sample has been obtained and 
that the conclusions based upon the respon- 
dents’ information will have acceptable gener- 
ality. 

Of the total returns from 60 institutions, 
15 institutions are not included in the analysis. 
Thirteen of these do not employ a full-time 
psychologist, one has not yet been activated, 
and one institution of fewer than one hundred 
patients is an atypical specialized home. The 
following analysis, then, is based upon 45 state 
institutions each of which employs at least one 
full-time clinical psychologist and has an ag- 
gregate patient population of 88,000. 


An Overview 


Brief reading of the questionnaire returns in- 
dicates clearly that psychological activities are 
in a state of change. From almost every insti- 
tution the reports indicate that programs are 
being modified, that testing procedures are be- 
ing continually improved, that the physical 
features are being enlarged, and that the scope 
of the contribution which the psychologist is 
making to the over-all care of the mental de- 
fective is increasing. A random selection of 
comments from the returns indicates this fact 
quite clearly. “Present budgetary plans for the 
coming year include possible creation of two 
vacancies.” “Plans are being made to start a 
group therapy program.” “Plans in prospect 


for increasing amount of time devoted to re- 
search and for establishing a play therapy pro- 
gram.” ““We have recently expanded our out- 
patient clinic service.” “Plans are now being 
made for group therapy with girls who have 
difficulty in adjusting to institution.” “Money 
has been earmarked for an addition to pres- 
ent quarters.” “Expansion is in the planning 
stage.” “Play therapy room and equipment 
now under construction.” “We expect soon to 
fill the other Senior Assistantship which will 
meet our basic personnel needs from a profes- 
sional standpoint.” Thus, any description of 
current activities must be understood to be an 
artificially static portrayal and should be con- 
sidered within the context of a dynamic chang- 
ing profession. 


Current Activities and the Patient- 
Psychologist Ratio 


The major portion of the psychologist’s ac- 
tivities are in connection with testing, research, 
and therapy, but his other duties cover a wide 
area of administrative responsibility. Men- 
tioned as activities in addition to intramural 
testing, research, and therapy, are cooperation 
in outpatient clinics, training of employees, 
traveling school clinics, job placement both 
within and outside of the institution, recrea- 
tion program direction, attending staff meet- 
ings, interviewing parents of prospective pa- 
tients, teaching at affiliated universities, and 
speaking to lay groups. At at least two insti- 
tutions the psychologist also censors incoming 
mail and is in charge of the medical library. 
A total of 97 per cent of the average psycholo- 
gist’s time, however, is accounted for under 
the broad headings of administrative duties, 
psychological testing, research, and therapy. 

Of the group of 45 institutions whose ques- 
tionnaire returns are analyzed, 41 are able to 
indicate the approximate proportion of time 
engaged in each of these areas of activity. As 
a group, psychologists’ activities are divided so 
that 12 per cent of the work-week is utilized 
in administrative functions, 60 per cent in 
testing, 10 per cent engaged in research, and 
15 per cent in psychotherapy. 

It seems clear that at the institutions where 
one psychologist is employed for a patient 
population of 2,000 or 2,500, activities must 
be concentrated in the area of psychological 








ren 


.o & 


lid 


it 


al 








caine 


Psychological Services and Problems in Mental Deficiency 189 


testing. In order to perform the basic func- 
tion of a psychologist in such a setting, col- 
lateral activities must be sharply curtailed. 
Given the total patient population and the 
number of psychologists employed at a particu- 
lar institution, it is possible to compute a pa- 
tient-psychologist ratio, the magnitude of 
which yields information regarding the sheer 
numerical adequacy with which that particu- 
lar institution is able to offer its psychological 
facilities to its patients. The higher this ratio, 
the more patients there are per available psy- 
chologist. That this continuum probably does 
not coincide with the dimensions of compe- 
tence, value, or contribution should be stated 
explicitly. While it is possible that where there 
is a lower patient-psychologist ratio, there is a 
more competent psychology department, the 
present data offer no way of testing such a hy- 
pothesis. Examination of the distribution of 
professional activities as a function of patient- 
psychologist ratio may, however, yield infor- 
mation indicating how departmental activities 
differ with respect to this function, in what 
ways additional personnel are utilized when 
the ratio decreases, and hence which activities 
are considered to require additional time. It 
would seem possible, for example, that the cre- 
ation of a vacancy and the acquiring of a new 
psychologist, with the resultant decrease of 
patient-psychologist ratio, would make it pos- 
sible to devote additional time to testing, re- 
search, or therapy, or to any combination of 
these. If in examining the data, consistent 
trends are observed, it will seem a sound de- 
duction that the areas which show the increase 
are the ones where greater emphasis is needed. 
In the present data, the mean number of pa- 


tients per psychologist is 882 with a range of 
201 to 3500. 


Table 2 indicates the distribution of profes 
sional activities for the 17 institutions whose 
patient-psychologist ratio is higher than the 
mean (more patients per psychologist) as com 
pared with the 24 institutions whose patient 
psychologist ratio is below the mean (fewer 
patients per psychologist). Significant differ- 
ences are readily apparent. It is only in the 
proportion of time in administrative activities 
that there is no difference between the two 
groups. The institutions with the lower pa- 
tient-psychologist ratio spend significantly less 
time in psychological testing, and significantly 
greater time in research and therapy. That 
these differences are statistically significant 
despite the small number of cases in each 
group indicates the directions in which psy 
chological activities are being modified as their 
facilities become numerically more adequate 
and seems to parallel the general trend of psy 
chological activities at all types of installa 
tions, namely, greater emphasis upon research 
and therapy. 


Psychological Testing 

One hundred and twenty different test in 
struments are listed as being utilized by the 
responding institutions. Only 63 of these are 
mentioned by more than one institution and 
hence a high degree of variability and indi 
viduality seems an important characteristic of 
departments of psychology throughout the 
country. To the extent that this reflects an at- 
tempt to utilize all test instruments for their 
possible contribution to the better understand- 
ing of an individual patient, it would seem to 
represent a positive sign. 

Tables 3, 4, and 5 present a list of those 
test instruments mentioned by three or more 
institutions. Following each test will be found 


Table 2 


Percentage of Time Engaged in Professional Activities as a Function 


of Patient-Psychologist Ratio 








Activity 


Research 








Testing Therapy 

Ratio High Low High Low High Low High Low 
Mean 13.88 11.92 67.24 55.29 6.06 12.04 8.29 16.54 
SD 12.51 7.75 19.95 15.78 6.71 8.46 8.46 15.62 
Diff. 1.96 11.95 5.98 8.25 
S.E. Diff. 3.52 5.98 2.43 3.88 

t 0.55 2.00 2.46 2.13 

r not sig. 0.05 0.02 0.05 











190 Bernard L. Bloom 


Table 3 


Use of Psychological Test 








Test Instrument 








Revised Stanford-Binet (Form L) 
Wechsler-Bellevue (Form 1) ne a ene 
Revised Stanford-Binet (Form M) .......... 
Wechsler Intelligence Scale for Children . 
Kuhlmann Tests of Mental Development . 
Goodenough Drawing Test 
Wechsler-Bellevue (Form II) 
Arthur Point Scale (Form I) 
Cattell Infant Intelligence Scale 
Merrill-Palmer fe Rec a SS eee 
Gesell Developmental Schedules... 
Cornell-Coxe Performance Ability Scale ~ 
Hayes Revision of the Stanford-Binet (Blind) 
Arthur Point Scale (Form II) 
Nebraska Test of Learning Aptitudes (Deaf 
Porteus Mazes 
Kent E-G-Y 
Pintner-Paterson SATE OLE 
Minnesota Pre-School 
Stanford-Binet (1916) 

Chicago Non-Verbal . 

Kohs Blocks ....... 
Wechsler Memory Scale ....... 
Healy Picture Completion 
Leiter International Performance Scale -......... 





Instruments: Intelligence 
Freq. % 
ee ae a 43 96 
a —— 42 93 
28 62 
27 60 
23 51 
23 51 
21 47 
20 44 
16 36 
14 31 
14 31 
13 29 
10 22 
7 16 
7 16 
sates 6 13 
6 13 
6 13 
6 13 
5 11 
4 9 
. 3 7 
3 7 
” 3 7 
3 7 





Table 4 


Use of Psychological Test Instruments: 
Character and Personality 











Test Instrument Freq. % 
Rorschach 37 82 
Thematic Apperception Test 32 71 
Vineland Social Maturity Scale 25 56 
Bender Visual Motor Gestalt 21 47 
Machover Draw-A-Person 12 27 
House-Tree-Person 22 
Minnesota Multiphasic Personality 

Inventory 10 22 
Szondi 9 20 
Children’s Apperception Test 6 13 
Make-A-Picture-Story 5 11 
California Test of Personality 5 il 
Symonds Picture-Story a 9 
Rogers Test of Personality + 9 
Goldstein-Scheerer Cube Test 3 7 





the frequency and percentage of all institutions 
reporting use of the test. The tests are cate- 
gorized into three types following the clas- 
sification used by Buros [1]. Although in 
the present questionnaire tests were listed un- 
der three groups, i.e., intelligence and devel- 


Table 5 
Use of Psychological Test Instruments: 
Achievement and Aptitude 

















Test Instrument Freq. % 
Stanford Achievement Tests 9 20 
Wide Range Achievement Tests 8 18 
Gray Oral Reading Test 7 16 
Metropolitan Achievement Tests 5 il 
Progressive Achievement Tests 3 7 
Werner Marble Board 3 7 
Oseretsky Test of Motor Proficiency 3 7 





| 
| 
| 


opmental, personality, and other, it is not pos- 
sible to use this grouping in reporting the 
findings. Many tests are listed in more than 
one category depending on the use made of 
the test at different institutions. The most out- 
standing example is the reported use of the 
3ender Gestalt. It is mentioned by 21 insti- 
tutions, 3 times as a test of intelligence, 8 
times as a test of personality, and by 10 in- 
stitutions as falling in neither of these cate- 
gories. 

Table 6 indicates the most frequently uti- 
lized tests of intelligence and personality at 








a a 








Psychological Services and Problems in Mental Deficiency 191 


Table 6 


Most Frequently Used Psychological Tests 











Test Instrument Freq. % 





Intelligence 
Revised Stanford-Binet (L or M) 31 69 
Wechsler-Bellevue (1 or IT) y 20 
Stanford-Binet (1916) 4 9 
Kuhlmann Tests of Mental Development 1 2 
Personality 
Rorschach 2 
Thematic Apperception Test 
Machover Draw-A-Person 
MMPI 
Vineland Social Maturity Scale 
House-Tree-Person 
Lowenfeld Mosaic 
California Test of Personality 
Szondi 


~~ ee rt ee ee EW OO 


Rogers Test of Personality 


INNNNKHANUAS 





each institution. The Revised Stanford-Binet 
and the Rorschach, it can be seen, are by far 
the most commonly used instruments. The 
1916 Stanford-Binet, mentioned by five insti- 
tutions as part of their test battery, is used 
more frequently than any other test at four 
of these institutions. Two institutions report 
no personality tests in use. Again the wide 
variability, especially in the area of personality 
testing, can be seen. 

Walker [2] reported a similar list derived 
in a somewhat different manner from a simi- 
lar sample of institutions. Comparison of the 
present list with that obtained by Walker in- 
dicated that in the less than two years sepa- 
rating the two questionnaires, the use of the 
Wechsler Intelligence Scale for Children has 
increased fivefold. In the area of personality 
testing, the Machover Draw-A-Person Test 
is now mentioned six times as frequently, the 
H-T-P three times as often, and the MMPI 


twice as often. 
Current Needs 


Of the total sample of 45 institutions, 34 
(76 per cent) express some needs in order that 
their departments of psychology might operate 
more effectively. This figure compares favor- 
ably with Walker’s finding that “ . . . 72 per 
cent of the institutions feel that their facilities 
in the psychological field are inadequate” [2, p. 
428]. Twenty of the 34 institutions require 
professional personnel. Thirteen need one addi- 
tional psychologist; seven require two addi- 


tional psychologists. Nine institutions express 
a desire for interns and seven indicate a short- 
age of stenographic assistance. 


Table 7 
Equipment Needs of Departments of Psychology 

Need Freq. % 
Testing equipment 14 41 
Play therapy room and equipment 12 35 
Office and testing space 12 35 
Library facilities 7 21 
Tape or wire recorder 6 18 
One-way vision mirror 4 12 
Audiometer 3 9 
Telebinocular 2 6 
Intercommunication system 2 6 
Calculating machine 1 3 


In Table 7 are indicated the nature and ex- 
tent of needs for additional equipment. It can 
be seen that the most frequent needs are for 
more testing equipment, space and equipment 
for play therapy, and expansion of office and 
testing facilities. 

















Table 8 
Potential Extension of Services 
Extension Freq. % 
Individual psychotherapy and counseling 25 74 
Research 15 44 
More adequate testing of patients x 24 
Rehabilitation 21 


7 
Outpatient clinic 6 
Group psychotherapy 5 
Retesting of patients 4 12 
Training of staff 4 
Intern-training program 3 
Job placement within institution 3 
Personnel testing 1 





On the basis of the fulfillment of these per- 
sonnel and equipment needs, the responding in- 
stitutions indicate that their current services 
could be expanded considerably. Table 8 indi- 
cates the nature and extent of these additional 
contributions. Most affected by the present 
shortages is the amount of time devoted to in- 
dividual psychotherapy and counseling, and to 
research. Almost three-fourths of the depart- 
ments would be able to expand their psycho- 
therapeutic program, and nearly one-half of 
the institutions would be able to augment 
their research programs. It should be pointed 








192 


out again that some of these expressed needs 
are in the process of being met, and some of 
these potential services will be put into opera- 
tion in the near future. 
Discussion 

From two independent sources in the pre- 
sent data, the direction of changes in psycho- 
logical activities at institutions for mental de- 
fectives becomes clear. ‘These changes are to- 
ward greater emphasis upon research and psy- 
chotherapy, and a decrease in emphasis upon 
testing exclusively. First, when institutions 
with fewer patients per psychologist are com- 
pared with institutions with more patients per 
psychologist, the former departments of psy- 
chology devote significantly more time to 
research and therapy and significantly less 
time to testing. Psychological examination 
still remains the major contribution of the 
psychologist, however. Second, expressed 
changes in activities as a function of the meet- 
ing of current needs are in the direction ot 
greater emphasis upon research and therapy. 
With this apparent unanimity of purpose, pre- 
sent activities remain highly individualized. 

Of the large number of tests currently 
utilized at each institution for the evaluation 
of the intellectual level, personality structure, 
and aptitude and achievement constellations, 
only half are mentioned by more than one 
institution. While this finding is partially due 
to the varying degrees of completeness with 
which institutions listed their currently used 
instruments, it is apparent that there is but 
little agreement as to the most effective tools 
at hand. It is probably also true that individ- 
ual psychologists are able to obtain differing 
amounts of information from a particular test 
instrument depending upon their own experi- 
ence and training. This factor may help ex- 
plain the marked individuality in testing 
practice. The clinical psychologist has literally 
thousands of psychological test instruments to 
choose from, and that there is so little simi- 
larity from institution to institution regarding 
the instruments chosen for use emphasizes the 
need for research designed to evaluate the 
relative merits of many of these instruments 
whose functions in the clinical armamentarium 
overlap. 

Current needs of departments of psychology 


Bernard L. Bloom 


do not seem unrealistic or unattainable. Yet 
with these needs met, a vast increase in the 
services which psychologists offer their patients 
is expressed. A similar study five years from 
now would yield information indicating the 
actual nature of the changes in activities as a 
function of those current needs actually met. 


Summary 


In an attempt to achieve insight into the 
activities and needs of departments of psychol- 
ogy at institutions for mental defectives, a 
comprehensive questionnaire was submitted to 
79 state institutions caring for mental defec- 
tives, epileptics, or both. Returns were ob- 
tained from 60 of these institutions. ‘The ma- 
jor findings may be summarized as follows: 

1. ‘Twenty-two per cent of the responding 
institutions are without the services of a full- 
time clinical psychologist. 

2. There is great individuality in the pre- 
sent choice of test instruments with a trend 
toward more frequent use of recently de 
veloped tests of intelligence and personality. 

3. There is marked agreement regarding 
the direction of future changes in psychol- 
ogical activities, namely, greater emphasis up- 
on research and psychotherapy. 

4. The present shortage of professional per- 
sonnel in departments of psychology at institu- 
tions for mental defectives would be alleviated 
by the employment of a minimum of twenty- 
seven additional psychologists throughout the 
country. This figure represents the explicitly 
stated needs of the institutions whose responses 
are analysed. In addition, thirteen responding 
institutions are without the services of a full- 
time psychologist and their needs should be ad- 
ded to this figure. 

Received September 12, 1951. 


References 
K. The third mental measurements 
yearbook. New Brunswick, N. J.: Rutgers Uni- 
ver. Press, 1949. 
Walker, G. H. Report of committee on admin- 
istration survey of institutional standards. A mer. 
J. ment. Def., 1951, 55, 422-446. 
Mental Health Statistics. Current reports. Fed- 
eral Security Agency, Public Health Service, 
National Institute of Mental Health. Dec., 
1950. Series IMH-BS1, No. 1. 
4. Officers, Fellows, and members of the Ameri- 
can Association on Mental Deficiency. Mans- 
field Depot, Conn., 1950. 


1. Buros, O. 


N 


w 











Patients Diagnosed Manic Depressive Psy- 
chosis — Manic State’ 


J. R. Wittenborn and Walter Weiss 


Yale University 


The present study seeks to explore and 
classify symptomatic similarities and differ- 
ences among a group of patients who have a 
common diagnosis. It is possible that conven- 
tional psychiatric diagnoses may have less de- 
scriptive value than a diagnostic procedure 
which evaluates the patient on the basis of the 
degree to which his symptom manifestations 
resemble each of several symptom clusters [3]. 

The group comprises twenty newly admitted 
patients at the Connecticut State Hospital; 
all have a diagnosis of Manic Depressive 
Psychosis—Manic State. A strictly consec- 
utive basis for selecting the patients was mod- 
ified sufficiently to secure the advantage of an 
equal number of men and women. The over-all 
symptom similarities and differences among the 
patients were classified by determining the de- 
gree to which they resembled each other with 
respect to their ratings on a set of 55 symptom- 
rating scales. The symptom ratings were made 
during the first few weeks of the patient’s 
hospitalization and were, in every case, com- 
pleted before the final diagnosis was made. 
The final diagnosis was the result of a staff 
evaluation and included the contributions of 
psychiatrists, psychologists, internists, and 
social workers. 

The present technique for classifying the 
patients involves correlating each patient with 
every other patient for the sample of 55 symp- 
tom-rating scales and then determining the 
similarity of symptom patterns among the 
patients by means of Thurstone’s procedure 
for multiple factor analysis. The intercorre- 


1The support of Dr. Mark A. May, Director of 
the Institute of Human R:lations, Yale University 
is gratefully acknowledged. The writers also wish 
to acknowledge the cooperation of the staff of the 
Connecticut State Hospital. This report is based on 
a student project in the writer’s class in statistics. 


lations for the patients are shown in ‘Table 1. 
If the diagnosis, Manic Depressive Psycho- 
sis— Manic State, were descriptively sufficient, 
we should expect to find all of the patients 
clustering together to form one conspicuous 
general factor. If, however, there is no con- 
spicuous general factor and there are impor- 
tant group factors, it would be concluded that 
the differences among the patients (e.g., 
differences in prevalence of paranoid symp- 
toms) are approximately as great as the simi- 
larities (e.g., similarities on “manic”’ symptom 
rating scales), and that for the samples of 
patients such as the present one, the diagnostic 
grouping Manic Depressive Psychosis— 
Manic State should be abandoned in favor of 
some descriptive procedure which recognizes 
important differences among patients. 

When there is no important general factor, 
there are two respects in which symptom 
similarities among patients may be revealed by 
factor analysis: 

A. A given rotated factor may indicate the 
presence of certain patients who are sympto- 
matically similar to each other. When practi- 
cally all of the common factor variance of 
certain patients may be accounted for in terms 
of one particular factor, these patients could 
be considered as relatively “pure cases” of 
whatever symptom pattern the rotated factor 
represents. A study of the patient’s symptom 
similarities would be necessary for ascribing 
any meaning to the factor in question. 


B. Some of the patients may be factorially 
complex in the sense that their common factor 
variance is divided among several factors. 
These patients would, of course, be of little 
value in ascribing meaning to any one of the 
factors. It would be supposed, however, that 
similar factor patterns among factorially com- 


193 




















; , to vo Sa 5 ees 

Icz 6ze «00S's«dIZS’—=«CS8S° «hGH SC«90"”~=CE’SC*CSOS~=CSOS’~=CSHL~SC‘SSC(‘“«‘iLSS’S:=“<C*érSC*«i‘ SS ~SCOSCS”~Cd~COLC(CiéiNS:*«t 

s80- 180° ££0" 68% Ib Z9I- 898° ZhO- PLT" S6E° $62 $ZO° LZ0- 9EI" ISI zSz see 8b W 61 

967 959 16I 9II' 29" S8bO- HOI’ 062° SLE 898 6f2 42° ISe’ S8h ESE Ise OIS' W gI 

OSE’ 998 906 992° 990° zzz 829° Lee ss’ Zoe Sze 995° zsh O1s O6F “48S A LI 

L6l° 892° 9b’ 000° 080° 696° Sts 868 10S $2 662 Ih 9h “se Iss’ We 91 

$6S° $50" +0L" SES 906 6E2' 992" Obf 9b Oe IST’ 8ZE° 09S SoS a SI 

. 1€%’ €Z&" Z9l’ 8b 59° £99° Of 96° 695° S6I’ 60S° Ses’ 9 @f +1 
‘3 190- LZ0" IS +hO" Sez O91' zi" Oe 862 910- 860° 9 8A £1 
= 98> SIZ Z9e Ize LEZ 88% 921° OI- 62 LO 6 A ZI 
& 862° z0e’ =O SLE"6SZ"ié«*dTCZE"”=“‘ééT”:«=CS9O"”SCSES’ «OC6SE’ «OCOSte”COCC I 
N L8s° StS" «619 Zbe"SCsCLO”’,—«s«ZBS* SOBER OTE’ = SEL” d 01 
i 698° Sth LSS SZS° OL OLS’ 069 «64¢¢L)~COU 6 
5 vs’ ShhOLSS' «8° OS" 8ST CCS 
e 666 $19" 8b CLGS" SCOTSH’ C699" WN l 
$8r $0Z— 66S" teh CON 9 
R t6b 68S" 60hsS89"”—C—COW § 
= sez 6" 6ZN—‘és t 
- 1*9 «6079s WN g 
ey 915° A Z 
a if 

Ww W WwW q W d q J 4 a A WwW W WA A AW a W d a xag ‘ON 

oz «61 81 LI 91 C1 7 €1 rat 11 OF 6 8 l 9 ¢ t ¢ 4 I uaneY 


(sajeog Buney wojdwAg ¢¢ jo ajdureg & uo paseg suonelaiI09) 
AIC NuLpl—sisogoksg sAtssaidaq suevyy pasouserq] swoneg OZ usemjag, suomelss10019}U] 


1 TPL 


oa 
oO 














Patients Diagnosed Manic Depressive Psychosis—Manic State 


SYMPTOM CLUSTER ! 


1. ACUTE ANXIETY 

2. CONVERSION HYSTERIA 

3. MANIC STATE 

4.DEPRESSED STATE 

5. SCHIZOPHRENIC EXCITEMENT 
6.PARANOID CONDITION 

7 PARANOID SCHIZOPHRENIC 
8.HEBEPHRENIC SCHIZOPHRENIC 


9 PHOBIC COMPULSIVE 
LEGEND FOR PROFILES 








195 


CLUSTER SCORE 
, 2° ooo" oor Oe @ 


FEMALE PATIENTS NOS. 10 AND !I7 WITH COMMON FACTOR 


VARIANCE PRIMARILY IN FACTOR I 





MALE PATIENTS NOS. 3 AND 6 WITH COMMON FACTOR 
VARIANCE PRIMARILY IN FACTOR Y-----———---—-—-- 


Fig. 1. Symptom cluster profiles for four patients, illustrating the profile similarities of manic pa- 


tients with similar factorial compositions. 


plex patients are due to similar symptom pat- 
terns. 

Regardless of the manner in which the 
symptom similarities among patients are re- 
vealed by a factor analysis of their inter- 
correlations, the absence of a conspicuous 
general factor among patients who have a 
common diagnosis or the presence of impor- 
tant group factors among such patients is a 
clear indication that from the symptomatic 
standpoint, at least, the diagnostic grouping 
is descriptively inadequate (i. e., the diagno- 
sis may refer to no similarity among the 
patients or the similarity among them may be 
obscured by differences). If, however, simi- 
larities and differences are revealed among 
patients by factor analysis, and if these 
differences are nicely related to similarities 
and differences provided by a standard multi- 
ple diagnostic procedure, one could justifiably 
claim that the multiple diagnostic procedure 
is superior to the simple diagnostic grouping 


of psychiatrists. The multiple diagnostic pro- 
cedure which the present investigation seeks to 
evaluate is quantified in nature and is based 
upon a weighted scoring of 55 symptom-rating 
scales. As a result of earlier studies of cluster- 
ing tendencies among the symptoms of mental 
hospital patients [4, 5], nine symptom clusters 
were observed to exist. The nature of these 
symptom clusters and the degree to which each 
of the 55 symptom-rating scales contributes to 
each cluster was determined by applications of 
multiple factor analysis. 

Scores for each of these nine symptom 
clusters have been determined for the sample 
of twenty patients who provide the data for 
the present analysis. Figure 1 illustrates the 
quantified multiple nature of this new diagnos- 
tic procedure and, incidentally, shows the 
similarity in diagnostic profiles for patients 
who have similar patterns of factor loadings. 

If as a result of the present study it can be 
shown that the patients with similar factor 








196 


patterns have similar quantified multiple di- 
agnostic profiles, it may be concluded that the 
similarity among the patients with respect to 
symptom similarity 
among the patients with respect to symptom 
cluster e., it may be concluded that 
the symptom-cluster scores are sufficient to re 
the 55 


group 


pattern is reflected by 


scores, 1. 


and differences in 
symptom-scale ratings for the 


veal similarities 
present 
of patients. If it were found that patients who 
have similar symptom patterns (as revealed by 
similar patterns of factor loading) were not 
similar to each other with respect to symptom 
would be that 


scores based on the symptom clusters originally 


cluster scores, it concluded 
determined for heterogenous groups of patients 
are not sufficient for the purpose of describing 
the symptomatic similarities and dissimilarities 
among patients falling in one diagnostic group, 
i. e., Manic Depressive Psychosis-Manic State. 

Although such a discrepancy would not 
obviate the apparent descriptive value of the 
quantified multiple diagnostic procedure, such 
discrepancies would indicate a practical de- 


J. R. Wittenborn and Walter 


Weiss 


scriptive limitation for the quantified multiple 
diagnostic procedure. 
‘The Analysis 
‘Table 


This matrix is a result of the conventional ro- 


2 shows the rotated factor matrix. 
tational procedure which seeks to maximize 
the number of zero loadings and minimize the 
number of positive loadings. An orthogonal 
structure was maintained both because it was 
feared that an oblique rotational procedure 
might obscure an important general factor,’ 
and because an oblique structure could not well 
illustrate differences the 


maximal among 


patients. 


Che rotational procedures sought neither to 


maximize nor to minimize the apparent importance 


of a general factor. Comparisons between the first 
centroid factor loadings and the respective com 
munalities did not suggest the existence of an im 
porta general factor which would be obscured by 
the rotational procedure, Doubtlessly some proce 
dures could show a general factor but any suspi 
cion that a general symptomatic similarity exists 


among the patients may be 
of Table 1. 


dispelled by a scrutiny 


Table 2 
Rotated Factor Loadings for 20 Patients Diagnosed 
Manic Depressive Psychosis—Manic State 


Patie nt 
N Sex I 
1 F 498 
2 F ==, 082 
M 072 
2 F 580 
M 585 
é M 145 
7 M 396 
8 M 248 
9 pa a PRES eee Tete 118 
1{ F 773 
11 F .032 
12 I —.108 — 
13 F 458 
14 — EE Pee .33 
15 Oe Ak cee ee .083 
16 gar ce al eae ton .146 
17 F E .631 
18 M “ .149 
19 M Se ee ee —.097 —. 
20 M oa 158 d 
Mean of Squared 
Factor Loadings* 
Men oo ad Sa ns Oe .660 1. 
Women eee ee Sel Oe 1.891 , 


118 
064 
.169 


797 


lai 


.203 
-756 


059 
498 


852 
915 


Lil IV V VI h? 
100 -136 325 434 884 
131 294 428 480 677 
—.025 232 726 221 755 
.036 —.002 .069 —.033 629 
.027 .034 553 263 -742 
368 .204 653 .029 -642 
011 285 497 119 581 
.269 217 -318 692 .907 
091 —.007 403 -761 884 
—.148 324 188 216 872 
.280 558 147 095 433 
—.012 732 113 420 -740 
500 .008 —.131 .037 332 
015 -105 347 -638 654 
005 461 101 774 857 
—.037 005 219 .060 603 
.046 .075 .210 262 560 
-107 —.031 089 .026 615 
—.394 110 .098 584 531 
.047 189 511 424 -752 
.072 .272 2.096 1.717 
156 1.286 521 1.725 





*Signs of factor loadings were retained in determining the means. 


— 





03 
60 
15 
31 
52 





Patients Diagnosed Manic Depressive Psychosis—Manic State 197 


The individuals who determine each of the 
various factors are of considerable intrinsic in- 
terest because they may be an indication of the 
nature of some of the principal varieties of pa- 
tients who may be diagnosed Manic Depressive 
Psychosis—Manic State. Figure 1 shows the 
quantified multiple psychiatric diagnostic pro- 
files for individuals who most determine the 
nature of two of the factors. From Figure 1 it 
may be seen that each of the factors represents 
the presence of individuals whose symptom pat- 
terns are highly similar to each other and con- 
spicuously different from the symptom pat- 
tern of individuals who determine the other 
factor. One of the most interesting aspects of 
this classification of patients is the possibility 
that some of these symptomatically homo- 
genous subgroups may be determined by the 
sex of the patient. This is perhaps most clearly 
indicated in Table 2 which, for each factor, 
gives separately the mean of the factor load- 
ings squared for the women and the mean of 
the factor loadings squared for the men. (In 
summing the squared factor loadings the orig- 
inal signs are retained. A squared factor load- 
ing indicates the portion of total variance of a 
patient’s symptoms which is ascribable to the 
factor in question.) The differences between 
the sexes are significant at the 1 per cent level 
for Factors I, II, IV, and V. The similarities 
within the two groups of individuals who form 
Factors I and IV may be primarily due to fe- 
maleness whereas the similarities within the 
two groups of individuals who form Factors 
Il and V may be primarily due to maleness. 
Inasmuch as no hypothesis is offered whereby 
sex per se could determine symptom patterns, 
it is preferable to think of the sex determina- 
tion of symptom patterns in a statistical sense 
only. 

Inasmuch as the mean ages of the men and 
the women patients were approximately uni- 
form, it can be claimed that the differences 
here ascribed to sex are not in reality ascribable 
to the age of the patient. 

The nine symptom cluster scores for the 
quantified multiple psychiatric diagnostic pro- 
cedure were examined for each of the twenty 
patients comprising the present sample. All the 
patients had a relatively high score on the man- 
ic cluster and this similarity was probably re- 
sponsible for their common diagnosis. Never- 


theless, this similarity was not sufficiently 
marked or consistent within the group to result 
in uniform positive intercorrelations among the 
patients, and the fact that the factor analysis of 
manic patients revealed six groups of patients 
attests to the diversity of symptom patterns 
within the manic group. 

Factor I is for the most part determined by 
women who are characterized by highest scores 
on the manic state cluster, the schizophrenic 
excitement cluster, and the hebephrenic or 
deteriorated schizophrenia cluster. ‘They also 
have conspicuously high scores on the clusters 
for paranoid schizophrenia and depression. 
The cluster scores for these women are quite 
different from the cluster scores for the women 
who are primarily responsible for Factor IV. 
In general the Factor IV patients have much 
lower ratings on all the symptom-cluster scores 
than those who comprise Factor 1. ‘The highest 
cluster scores for the Factor IV patients are 
for the manic state cluster and the schizo- 
phrenic excitement cluster, and their cluster 
scores for depression and for hebephrenic or 
deteriorated schizophrenia are minimal. 

The men who comprise Factor II have their 
highest symptom-cluster score for the manic 
state and they have relatively high scores for 
the paranoid condition and the schizophrenic 
excitement clusters. Their scores are minimal 
for all other clusters. 

The men who are primarily responsible for 
Factor V have extraordinarily high scores for 
the paranoid schizophrenic, the manic state, and 
the paranoid condition clusters. They also have 
conspicuous ratings for the anxiety cluster and 
the hysterical cluster. 

Factor VI seems not to be determined pri- 
marily by one sex alone; the individuals who 
comprise it have relatively high ratings for the 
paranoid schizophrenia cluster, the manic state 
cluster, and the paranoid condition cluster. 
The ratings for the other clusters are negligi- 
ble. 

Summary and Conclusions 

In the present analysis a sample of twenty 
patients diagnosed Manic Depressive Psycho- 
sis—Manic State provides the basis for an 
evaluation of the descriptive efficacy of the 
psychiatric diagnosis in question. The proce- 
dure involves a factor analysis of the inter- 
correlations among the patients. The corre- 








198 J. R. Wittenborn and Walter Weiss 


lations between patients are determined by 
similarity between patients with respect to a 
standard set of 55 symptom-rating scales. 
Within the limits of the present sample of 
patients and symptom-rating scales the follow- 
ing conclusions are offered : 


1. Patients with the diagnosis of Manic 
Depressive Psychosis—Manic State tend to 
differ from each other in specifiable sympto- 
matic respects. 

2. There is evidence that among patients 
with the diagnosis of Manic Depressive Psy- 
chosis—Manic State sex differences are asso- 
ciated with important symptom differences. 

3. The symptomatic similarities and differ- 
ences among patients bearing the diagnosis 
Manic Depressive Psychosis—Manic State are 
revealed by a new descriptive procedure which, 
in both the present and in earlier studies [3, 
4, 5, 6, 7], has been referred to as a quantified 
multiple psychiatric diagnosis. It may be 
claimed for the present sample of patients, at 
least, that important similarities and differ- 


ences among patients revealed by the new pro- 
cedure are obscured by the psychiatrist’s con- 
ventional diagnosis. 


Received August 3, 1951. 


References 


1. English, O. S. Observations of trends in manic 
depressive psychosis. Psychiatry, 1949, 12, 125- 
134. 

2. Good, R. Depression. Brit. J. Med. Psychol., 

1945-46, 20, 344-375. 

Wittenborn, J. R. A new procedure for evalu- 

ating mental hospital patients. J. consult. Psy- 

1950, 14, 500-501. 

4. Wittenborn, J. R. Symptom patterns in a group 
of mental hospital patients. J. consult. Psychol., 
1951, 15, 290-302. 

5. Wittenborn, J. R., & Holzberg, J. D. The gen- 
erality of psychiatric syndromes. J. consult. 
Psychol., 1951, 15, 372-380. 

6. Wittenborn, J. R., & Holzberg, J. D. The Ror- 
schach and descriptive diagnosis. J. consult. 
Psychol., 1951, 15, 460-463. 

7. Wittenborn, J. R. & Holzberg, J. D. The 
Wechsler-Bellevue and descriptive diagnosis. J. 
consult. Psychol., 1951, 15, 325-329. 


we 














Some Psychological Aspects of Dermatosis 


John C. Narciso, Jr. 
Chico State College 


Can a disease be considered apart from the 
person who suffers from that disease? In the 
laboratory, perhaps so; in the therapeutic sit- 
uation, probably not. 

It is generally agreed by those who have 
written on the subject that there are psycho- 
logical factors involved in certain skin dis- 
eases. The degree of importance assigned to 
the psychological factors in dermatosis, how- 
ever, varies from one author to another. Klaber 
[4] states that in its classical form dermatosis 
is entirely psychological in origin. Others warn 
that although the psychological factors may be 
of importance, they should not overshadow the 
observation and realization of other important 
factors. 


Van de Erve [8] made observations involv- 
ing 80 patients with dermatitis. These patients 
were found to be above average in intelligence, 
alert, hyperactive, and emotionally unstable. 
He concluded that the disease gave rise to the 
emotional instability rather than that the in- 
stability itself was a fundamental cause of the 
disease. 

Klaber and Wittkower [5] conducted an in- 
vestigation involving 50 dermatosis patients. 
Although they were unable definitely to fix an 
origin for the disease, aggravation and exa- 
cerbation were commonly attributed to worry 
and anxiety. Abnormal fears and anxieties 
were traced to childhood. These authors re- 
ported three psychopathological patterns re- 
peatedly observed as: (a) repressed sexual ex- 
citement; (4) repressed aggression; and (c) 
repressed need for attention. 

Dunkel [2] studied 45 neurodermatitis pa- 
tients and found three related trends in the 
accompanying emotional factors, namely: (a) 
difficulty in establishing useful relationships; 
(6) general craving for attention; and (c) 
hostility and thinly disguised depression. 

Montgomery [6], in a psychoanalytic study 


of a case of acne vulgaris, lists guilt, anxiety, 
narcissism, and mental and emotional conflict 
regarding sexual promiscuity as the basic mani- 
festations. 

An interesting observation that is pertinent 
here was made by Obermayer [7] when he 
suggested that psychogenic factors in derma- 
tosis are much more widely accepted in prac- 
tice than in theory by medical practitioners. 
To add some support to this, perhaps, is the 
report by Abramowitz [1] in which the help- 
ful use of placebos along with catharis in der- 
matosis cases was described. 


In reading the total literature, one can def- 
initely sense the feeling of agreement among 
observers, with individual dermatologists well 
represented, that psychological factors are in- 
volved in dermatosis. A next desirable step 
would be to attempt to demonstrate quanti- 
tatively, with adequate controls, the presence 
of psychological deviations, and beyond that 
to focus upon the particular p-ttern or pat- 
terns of deviation if deviation exists. This 
author has found no study in the literature 
dealing quantitatively with this problem, nor 
any study involving standard psychological 
testing devices. 

Although Obermayer [7] mentions it last 
in a graded list of the dermatoses in which 
there is demonstrable emotional disturbance, 
acne was selected as the manifestation of der- 
matosis to study in this investigation inasmuch 
as it is relatively common and more readily 
recognizable than the other forms of derma- 
tosis. To make use of another area of derma- 
tosis would have required medical diagnosis, 
an aid which was not immediately available 
to the author. 

The specific purpose of this study was to 
compare the psychological status, as measured 
by a standard psychological test, of a group 
of college students who exhibited acne of the 


199 








200 


face to a similar group of college students not 
exhibiting any form of acne. 


Subjects 


‘Twenty students, ten men and ten women, 
manifesting acne of the face were selected at 
random by the author from the physical educa- 
tion classes at the University of Texas. This 
was the experimental group. For the control 
group, ten men and ten women free from any 
form of overt acne were selected at random to 
correspond in age, sex, and classification in the 
University with the acne group. The ages 
ranged from 17 to 23 years. The mean age 
was 20 years. 

Procedure 

The psychological test employed was the 
Minnesota Multiphasic Personality Inventory 
(MMPI), group form. Each of the subjects 
selected for both groups was approached indi- 
vidually by the author. They were simply asked 
to participate in a study to be conducted at the 
University. There were some few refusals by 
persons both with and without acne. Each par- 
ticipating subject completed the test at his own 
convenience at the University of Texas Coun- 
seling Bureau. Thus, there were no group sit- 
uations that might have indicated to the acne 
group that they were chosen because of the 
acne. Fach of the subjects was told that his 
name need not appear upon the test answer 
sheet as the study was a group study. All of 
the subjects, however, did give their names. 


Results 


Even though there was a degree of match- 
ing, the results of the two groups were com- 
pared statistically as independent groups. ‘The 
T scores of the MMPI were used in the anal- 
ysis. 

Each of the nine categories of the MMPI 
was studied individually. The results of this 
analysis are indicated in Table 1. The means, 
standard deviations, and mean difference for 
the groups for each category are presented. 
The ¢ test of significance was employed to test 
the difference of means. The ¢ for each cate- 
gory also appears in Table 1. 

As indicated by the results, the mean differ- 
ences of four of the nine categories are statis- 
tically significant. Mean differences for the 
hysteria, psychasthenia, and hypochondriasis 


John C. Narciso, Jr. 


Table 1 
Mean, Standard Deviation, Mean Difference, and 
t Score for the Two Groups for Each 
Scale of the MMPI 
Scale Group Mean SD MeanDiff. 2 
Cont. 50.90 6.21 9 
a 6.85 . 
H Acne $7.75 12.15 ores a 
; Cont. 53.45 10.64 ae i 
Acne 55.60 12.63 — ‘ 
H Cont, $3.30 6.67 ™ ; 
65 
Acne 59.55 9.30 ae 2.38 
Cont. 56.20 10.26 ; 94 
. . 
Pd Acne 59.55 11.46 - 
Cont. 59.26 12.14 9 24 
Mf Acne 58.30 11.99 a 
, Cont. $2.35 8.53 ae ca 
Pa Acne 56.20 9.28 — ‘ 
Cont. 54.25 8.86 10.¢ 224 
rt : 3 ).00 é. 
E Acne 64.25 17.32 
Cont. 53.65 7.79 a 
Sc - 80 3.2 
; Acne 65.45 13.82 uf ; 
Cont 56.95 7.50 ae 14 
Ma Acne 56.60 8.24 _ 


scales are all significant at the 5 per cent level 
of confidence, while that for the schizophrenia 
scale is significant at the 1 per cent level of 
confidence. 


a . 
Discussion 


Although the MMPI may not be accepted 
by some as a valid test of individual diagno- 
sis, aS a group trend indicator it is perhaps 
the best of the pencil-and-paper type person- 
ality scales now available. 

The interpretation of the MMPI scales is 
based upon the deviation of scores from a mean 
of 50. In general, a score of 70 or above is 
considered as nonconformity to the norm. 

It may be observed that none of the mean 
scores in this study is 70 or above for either of 
the groups. Thus, the significant differences 
found in Hy, Pt, Hs, and Sc are not indic- 
ative of abnormal states for the acne group as 
such, but rather, they simply indicate a sig- 
nificant trend in the deviation of the acne 
group from the control group. 

Based upon the declared meanings of the 
MMPI categories [3], the following analysis 


Some Psychological Aspects of Dermatosis 


of the differences derived might be suggested. 
The significantly higher score on the Hs scale 
indicates that the acne group tends to have a 
more abnormal concern about bodily functions 
than the control group. Those who score high 
on the Hs scale usually seem to have an im- 
mature approach to adult problems. Common 
organic disease does not lead appreciably to a 
high Hs score. The significantly higher score 
on the Hy scale indicates that the acne group 
tends more than the control group toward re- 
acting to the questions on the MMPI as do 
the conversion type hysterics by whom simu- 
lated organic difficulties are used as adjustment 
mechanisms. The significantly higher score of 
the Pz scale indicates that the acne group, more 
than the control group, tends to be troubled 
by phobias or compulsive behavior. ‘These pho- 
bias may include any type of unreasonable fear 
of things or of situations, as well as over-reac- 
tion to a more reasonable stimuli. The psychas- 
thenic symptoms may be manifest in a mild 
depression, excessive worry, or lack of self- 
confidence. The significantly higher score on 
the Sc scale indicates that the acne group, more 
than the control group, tends to withdraw 
from reality. 

It is evident that several of these symptoms 
were also reported in the clinical and psycho- 
analytic studies cited from the literature. The 
general emotional instability reported by Van 
de Erve [8], the abnormal fears, worry, and 
anxiety reported by Klaber and Wittkower 
[5] and Montgomery [6] have correlates in 
the symptoms indicated by the MMPI scales 
on which differences were found for these two 
groups. 

From the data presented, it may be con- 
cluded that there is a psychological involve- 
ment associated with acne, for this group, as 
measured by this standard psychological test. 
However, there is nothing concrete to indicate 
whether the condition is an effect or a genetic 
factor of the dermatosis. 

There is no attempt in this preliminary pa- 
per to relate the deviant categories to some un- 
derlying hypothesis. This must wait for a finer 
analysis. However, it appears that the present 


201 


study does support those authors who have hy- 
pothesized psychological involvement in der- 
matosis, as well as those who have reported 
such involvement in clinical or psychoanalytic 
studies. 

Thus, in answer to the original question, it 
would appear that the research into the gene- 
sis of and the therapeutic methods to be em- 
ployed in the dermatoses should not be with- 
out consideration of the psychological involve 
ment of the individual in relation to the overt 
physiological difficulty. 


Summary 


‘Twenty persons manifesting acne of the face 
were compared upon the Minnesota Muiti- 
phasic Personality Inventory to a correspond- 
ing group of twenty persons manifesting no 
overt acne. It was found that the acne group 
differed significantly from the control group 
on four of the nine scales of the MMPI, name- 
ly, hypochondriasis, hysteria, psychasthenia, 
and schizophrenia. 


Received July 16, 1951. 


Reterences 


i. Abramowitz, E. W. The use of placebos in the 
local therapy of skin diseases. N.Y. State J. 
Med., 1948, 48, 1927-1930. 

2. Dunkel, M. L. Casework help for neuroderma- 

titis patients. J. soc. Casewk., 1949, 30, 97-103. 

Hathaway, S. R., & McKinley, J. C. Manual 

for the Minnesota Multiphasic Personality In- 

ventory (Rev. Ed.). New York: Psychological 

Corp., 1943. 

4. Klaber, R. 


w 


Psychological factors in the aetiol- 
ogy of certain skin Brit. J. 
Syph., 1947, 59, 1-6. 

5. Klaber, R., & Wittkower, E. The psychogenesis 
of rosacea: a review with special reference to 
emotional factors. Brit. J. Derm. Syph., 1939, 
51, 501-524. 

6. Montgomery, L. Psychoanalysis of a case of 

acne vulgaris. Psychoanal. Rev., 1939, 26, 155- 

177. 

Obermayer, M. E. Functional factors in com- 

mon dermatoses. J. Amer. med. Ass., 1943, 

122, 862-864. 

8. Van de Erve, J. M. Functional studies in pa- 
tients with neurodermatosis. J. Amer. med. Ass., 
1935, 105, 1098-1103. 


diseases. Derm. 


~ 








Measurable Changes In Empathy With Age 


Rosalind F. Dymond 


The Counseling Center, University of Chicago 


Anne 8. Hughes 


Garden City, N. Y. 


and Virginia L. Raabe 


Morenci, Mich 


One of the important unanswered questions 
concerning the so-called ability to empathize 
with others, or take their role, is the develop- 
mental one. There are at least four possible 
ways that empathy and age may be related. 

1. The relationship may be direct. As the 
child proceeds in his socialization he increases 
the number and range of his contacts with 
others. Theoretically, he is incorporating more 
and more roles. These incorporated self-other 
patterns allow him to adjust his behavior to 
others by means of taking their roles incipient- 
ly and so modify his behavior to get a desired 
response from the other. This view suggests 
that empathic ability increases as a function 
of age. 

2. A second possibility is that the relation- 
ship may be inverse; that the ability to take 
the role of the other successfully decreases with 
age. Children do seem to play a variety of roles 
quite overtly: “I’m Daddy” or “I’m the cat.” 
This type of imaginative play diminishes and 
finally disappears as the child grows older. 
Perhaps as the “‘self’’ role becomes more clear- 
ly differentiated, the child loses the ability to 
change perspective easily and becomes, as Kou- 
nin [3] has suggested, increasingly more rigid 
with age. It may be debated whether this type 
of role play should be termed “empathic,” par- 
ticularly as empathy implies an accurate ex- 
periencing of the phenomenal field of the other 
to the extent of permitting correct prediction 
of his behavior. 

3. It is also possible that empathy and age 
are not so simply related but, as is true of so 
many other psychological processes, follow 
some sort of growth curve of development. In 


202 


this case empathy would rise rapidly early in 
the developmental history, maintain a high 
level once it had reached maturity, and finally 
start to decline with the onset of old age and 
the egocentricity which so frequently accom- 
panies it. 

4. Perhaps, fourthly, there is no consistent 
relation of the two at all, empathy being more 
importantly related to other variables than to 
age. 

To explore the relation between empathy 
and age, a small study was designed to meas- 
ure and compare the empathic ability of two 
groups of children who differed in age. 


Procedure and Subjects 


The samples for this study consisted of two 
groups of children drawn from the same rural 
public school. The backgrounds of the chil- 
dren were relatively homogeneous. Twenty- 
four children, 17 boys and 7 girls, all of whom 
were second graders, were compared to 16 
sixth-grade children of whom 10 were boys 
and 6 were girls. The mean age of the second- 
grade sample was 7.2 and of the sixth graders 
11.3 years. 


Test 1—The Projective Test 


The first problem was to design techniques 
for measuring the empathic ability of these 
children, because the previously reported de- 
vices [1, 2] were much too complicated for 
use with the younger subjects. Since empathy 
was defined as the imaginative and accurate 
transposing of oneself into the thinking, feel- 
ing, and acting of another [1, p. 127], it 
seemed to the investigators that a projective 


Measurable changes in empathy with age 203 


test modeled on the Murray and Morgan The- 
matic Apperception Test [4] might be one 
fruitful way of tapping the ability. In a pre- 
test it was found that the TAT itself was too 
unstructured for the younger children. A 
modified TAT was therefore devised. Nine 
pictures showing children with and without 
adults in a variety of situations were cut from 
magazines and mounted on cardboard of a 
standard size. These pictures ranged, as do 
the TAT cards themselves, from vague draw- 
ings to photographs. 

The pictures were shown to each child one 
at a time. In each case the child was given a 
simple standard story designed to familiarize 
him with the stimulus material. The subject 
was then asked a standard series of questions 
to elicit from him the thoughts and feelings 
of the characters in the story. It was hypothe- 
sized that children would vary in their ability 
to see things from the viewpoint of others, and 
that a child with high empathic ability would 
have a store of this kind of experience, as a re- 
sult of previous actual role-taking experiences. 
Conversely, a child with low empathy would 
be less able to take the role of others in the fic- 
titious social situations, as he would not have 
a backlog of thoughts and feelings of others 
incorporated from past face-to-face experiences. 

After the experimenter had read the story 
for a particular picture, the child was asked 
to “tell about the people,” “what do you sup- 
pose they are thinking and feeling?”’ He was 
allowed to answer freely. If no thoughts and 
feelings were included in his remarks, he was 
then asked a graded series of questions about 
each character in turn. A sample story and set 
of questions follows. 


The picture shows a dining room scene. The 
mother is seated at one end of the table. She is look- 
ing very cross and pointing to a boy who is about 
nine years old. He is standing as if he has just risen 
from his place and is looking at his very dirty 
hands. The father sitting at the opposite end of the 


table is surreptitiously looking at his hands under 
the table. 


Story: Billy has been playing in the sand lots 
all day. He was having such fun that he didn’t 
come home for supper until his mother had called 
him five times. Finally he came running in and 
sat down at the table. As he picked up his fork to 
eat his mother saw his dirty hands and sent him to 
wash them. His father is now looking at his own 
hands which aren’t very clean either. 


Question I-1. Tell me about the people in this 
picture. 
I-2. What do you suppose 
thinking and feeling? 
I-3. Try to describe them as they seem 
to you. 


they are 


If the thoughts and feelings of all the characters 
were mentioned in response to these general ques- 
tions no further questions were asked. If not, the 
following questions were posed until that point was 
reached. 


II-1. Tell me about Billy. 

II-2. What is he thinking now? 
II-3. How is he feeling? 

IlI-1. Tell me about the mother. 
IlI-2. What is she thinking now? 
III-3. How is she feeling? 

IV-1. Tell me about the father. 
IV-2. What is he thinking now? 
IV-3. How is he feeling? 


A score was given for each character ac- 
cording to the amount of prodding needed to 
bring out a statement as to their thoughts and 
feelings in the described situation. Five points 
were given for each character with whom 
some empathy was shown in response to I, the 
general questions; 4 points were assigned if a 
statement about the thoughts and feelings of 
a character was forthcoming in response to 
II-1 (the first question about a specific char- 
acter) ; 3 points if empathy was shown in re- 
sponse to I]-2; 2 points if it was displayed in 
response to II-3; and 1 point was given for 
no empathy with the character. Since each 
child was told stories concerning a total of 
twenty-four people, the minimum score was 24. 
The maximum score was not as certainly fixed, 
as the child might introduce more characters 
into the stories than those mentioned in the 
standard stories. The scores were not derived 
by the person who had recorded the children’s 
replies. 

Results of projective test. ‘The mean score 
of the second grade group was 51.3, and the 
range from 24 to 95. The mean of the sixth- 
grade group was 94.5, and the range from 43 
to 141. The difference between these means is 
significant beyond the 1 per cent level using ¢. 

Do the scores mean that empathic ability 
increases significantly over this age period, or 
can the difference be explained on other 
grounds? It seemed possible that the increase 
was due to the development of verbal facility, 
but a careful analysis of the individual records 








204 


established that this was not the case. Some 
records of second graders actually contained 
a greater amount of verbalization than those 
of the sixth graders, but these records were 
more concerned with a description of external 
details than with the internal thoughts and 
feelings. A definite qualitative difference is ap- 
parent in the records of the two groups; the 
sixth graders have obviously become more 
aware of, or at least more articulate about, 
the inner world of thoughts and feelings of 
others. 

Although the results of this testing were 
provocative, they could hardly stand by them- 
selves as a valid measure of empathy. There- 
fore another attempt was made to test this 
ability. As the projective test had made use of 
fictitious situations, it might be argued that 
the test was measuring creative imagination or 
some such other ability. For this reason the 
next test was designed to tap the children’s 
ability to understand their actual face-to-face 
relations with their peers. 


Test 2—The Social Insight Measure 


First, a sociometric technique was adminis- 
tered to both groups, which included three 
choice situations: whom would you like to sit 
beside, invite home to a party, and take with 
you when transferred to another room. Four 
answers were required to each question, three 
“like” choices and one “dislike,” Weighted 
scores of +5, +3, +1, and —5 were given in 
that order. Each child, therefore, received a 
total score which was the sum of the total plus 
and minus mentions of him by the others. 

The sociometric test was followed by a re- 
verse sociometric or social insight test. Each 
child was required to judge the extent to which 
he was liked or disliked by each other member 
of the class. This task was thought to require 
the ability to take the role of the others, and 
to see himself from the viewpoint of each of 
them. The names of all the children in the 
class were listed with six choices following 
each: 


1. John Jones (1) likes me most, (2) likes me very 
much, (3) likes me some, (4) likes 
me only a little, (5) dislikes me, (6) 
dislikes me very much. 


An empathy score was based on the degree of 


R. F. Dymond, A. 8. Hughes, and V. L. Raabe 


correspondence of the ratings of how others 
felt, to those received on the sociometric rating 
made by the others. The score was obtained as 
follows: The range of possible sociometric 
scores that could be given by each child to any 
other child was from +15 to —15 (if he re- 
ceived all three +5 or all three —5 choices). 
This range was divided into six groups, each 
of which was arbitrarily equated to one phrase 
of the social insight test. 


15to + 11 likes me most 


10to+ 7 likes me very much 
6to+ 3 likes me some 

2 to 0 likes me a little 

5 to 9 dislikes me 


10 to 15 dislikes me very much 


Perfect correspondence of predicted liking and 
sociometric score was given a social insight 
score of 0. One point was given for each place 
that the predicted liking was removed from the 
sociometric rating. For example, if a child re- 
sponded that “John Jones likes me very much,” 
and John actually gave that child only a score 
of —5 on the sociometric test, his empathy 
score with John would be 3. It is obvious then 
that the lower the score the higher the corre- 
spondence, or the child’s ability to understand 
how others feel about him. 

Again, the results show a clear advantage 
for the sixth-grade group. The scores were 
converted to means to equate for the number 
of others involved in the two groups. The 
range of the second-grade scores was from .66 
to 2.87, and the mean was 1.86. The range of 
the sixth grade was from .68 to 1.56, and the 
mean was 1.04. The difference between these 
means is also significant at the 1 per cent level. 

How are these two measures related? Even 
though there is a clear increase with age in 
the ability to deal successfully with both the 
projective and the social insight tests, are they 
measuring the same ability? The rank-order 
correlation between the two sets of scores is 
a low and positive +.46. While there is some 
overlap in the processes required in the two 
tasks, in that they both require the subject 
to penetrate the role of another individual, 
real or fictional, there are also differences 
which would lower the correlation. In the case 
of the projective test there is no criterion 
against which the statements are evaluated. 
Any statement of thoughts and feelings, appro- 


— Vv 


‘? 


weFerT? 


Aan da 


— OO ww es OO OD OR 


= 
- 


- = O&O 


7 


Measurable changes in empathy with age 205 


priate or inappropriate, is accepted as evidence 
of empathy, the score being based entirely on 
how readily these are elicited by the stimulus 
materials and the tester’s prompting. In the 
other test, the accuracy with which the feeling 
of the other toward the subject is judged is 
taken into account in the scoring. In the first 
case, too, we are dealing with a relatively neu- 
tral situation, that is, with pictures of people 
not known by the subject. ‘The second situa- 
tion is far from neutral, and may be quite 
threatening to the child, in that he is asked 
to admit that some of his classmates dislike 
him, perhaps in some cases that the majority 
dislike him very much. Since it is easier to 
admit that one is liked than disliked, we would 
expect the social acceptability or sociometric 
scores to be more highly correlated with the 
social insight scores than with the projective 
test scores. This does in fact occur. The rank- 
order correlations are shown in Table 1. 


Table 1 


Correlations of Sociometric and Projective Scores 
ind Sociometric and Social Insight Scores 


Sociometric- Sociometric- 
Projective Social Insight 


Grade 2 —.32 +-.26 
Grade 6 —.03 +.50 








Test 3—Friendship Qualities 

The fact that the correlation of the socio- 
metric scores and the social insight scores is 
not higher than +.26 for the second graders, 
raises another question: what are the qualities 
of those who are best liked at these two age 
levels? To get some insight into the bases of 
the friendship choices in the two grades an- 
other and final test was administered, consist- 
ing of twenty-five short descriptive phrases. 
The following instructions were given. 

Think of the children you like the best. Now 
from the list below choose the five things that tell 
about them the best. Draw a line under the words. 
Make sure you draw five lines. 

The phrases used were: 
. Cheerful 


1 7. Is friendly 
2. Good-looking 8. Shows off 
3. A good fighter 9. Is tidy and clean 
4. Feels sorry when oth- 10. Has a nice home 
ers are in trouble 11. Is understanding 
5. Is well dressed 12. Is always playing 
6. Has lots of spend- tricks 
ing money 13. Quiet and shy 


14. Talks a lot 20. Has nice manners 
15. Shares things 21. Gets angry 
16. Thinks a lot of 22. Gets good marks 
himself 23. Helpful 
17. Helpful to the 24. Tough 
teacher 25. Doesn’t get mad 
18. Good at sports easily 
19. Bossy 


‘Table 2 


Friendship Qualities Chosen by the Second Graders 


Rank in ' Rank in 
: tem 
Grade 2 -_ Grade 6 
l Has a nice home ; 12 
2 Is good-looking 12 
3, Has lots of spending money 9 
4. Cheerful 2 
5 Feels sorry when others are 
in trouble & 
‘Table 3 
Friendship Qualities Chosen by the Sixth Graders 
Rank in , Rank in 
Grade 6 —_ Grade 2 
1. Friendly 9 
2. Cheerful 4 
3. Is tidy and clean 13 
4. Has nice manners 10 
5. Shares things 10 











Tables 2 and 3 give the items that were 
chosen most often by the second and sixth 
graders and the comparative standing of these 
items in the other group. It seems clear that 
there are sharp differences between the two 
age groups in the bases for their friendships. 
The younger children put more stress on ex- 
ternals such as appearance and the material 
matters of nice home and money, while the 
older children have shifted more to an em- 
phasis on personality characteristics. ‘The lat- 
ter group of items seems to represent an in- 
crease in socialization or at least the internali- 
zation of middle-class norms as they seem to 
describe the ideal-typical “nice little friend” 
from a parent’s point of view. If the older 
children are really choosing their friends more 
on the basis of their having warm personalities, 
it would make more understandable the dif- 
ference in the correlations between the empathy 
measures and popularity, as measured by the 
sociometric test. Since the second graders are 








206 


not choosing their friends on the basis of their 
“understandingness” we can hardly expect 
those who are most popular to be high also 
in empathy. 

The correlations do increase, however, so 
that the children who are most popular in the 
sixth grade tend to be those with higher social 
insight. Since this finding does not also hold 
for the projective test, it should be interpreted 
rather cautiously. As children increase in age 
they are perhaps more aware of the thoughts 
and feelings of others, and better able to judge 
them accurately. Where the feelings of the 
others towards the subject are complimentary 
or easily acceptable to the self-image, they are 
more freely accepted and admitted. They are 
then not subject to the same pressures toward 
distortion for ego-protection as the damaging 
references would be. Individuals who are well 
liked may thus appear to be more empathic on 
a test of social insight. Those who are disliked 
may actually say they are liked, not through 
a lack of understanding of the feelings of 
others, but because of the difficulty in accept- 
ing the concept of self that such an admission 
would force. It is a different and harder task 
for the outsider to admit he is an outsider than 
for the insider to state his position accurately. 
The correlation of the sociometric test and the 
projective test scores shows that when the role- 
taking task is taken out of the realm of self- 
reference, there is no necessary correlation be- 
tween social acceptability and empathic ability. 
Both the “outsider” and the “star” of a social 
group may be either high or low in empathy 
as measured by the projective test. Not only 
is there the highly integrative, highly empathic 
group leader, but also the egocentric, domineer- 
ing, unempathic one who still may be courted 
by others for his power. Likewise, not all 
fringe members are insensitive. Some may be 
very understanding of others but lack the so- 
cial skills to capitalize their ability. One’s 
place in a group seems to depend then, not so 
much on one’s role-taking ability per se but also 
on social techniques which vary with the struc- 
ture and function of the particular group. 


Summary 


In summary, two empathy measures, a pro- 
jective test and a social insight test, both of 
which required the subject to assume the role 


R. F. Dymond, A. S. Hughes, and V. L. Raabe 


perspective of another, showed a marked in- 
crease with age from the seven-year to the 
eleven-year level. There was a low positive 
correlation between the two measures. Wide 
individual differences in ability were apparent 
within both age groups. The relation of social 
insight to popularity as measured by socio- 
metry, was positive and increased markedly 
from the younger to the older age group. This 
finding might be interpreted as showing a high 
relation between empathy and popularity, were 
it not for the observation that the correlations 
of the sociometric test and the projective test 
remained near zero. Taking into account the 
nature of the social insight task, a more valid 
generalization seems to be that while, on the 
whole, children become more aware and better 
able to assess the feelings of others accurately, 
they also become more aware of which feelings 
are ‘‘safe’”’ to recognize and admit, and which 
need defenses. Children in the center of the 
sociometric chart are more free to be empathic 
than those on the periphery. 

Concerning the problem of the relation be- 
tween age and empathic ability, perhaps the 
only contribution of this paper is to demon- 
strate some of the difficulties involved. Most 
obvious are the problems of the measurement 
of empathy in young children. Although the 
results of both tests suggest a direct relation- 
ship between age and empathy from the years 
of seven to eleven, perhaps they only reflect the 
dificulty that younger children have in ver- 
bal communication. The authors’ best estimate 
is that, by and large, the empathic ability of 
children increases with age. 


Received October 22, 1951. 


References 


1. Dymond, Rosalind F. A scale for the measure- 
ment of empathic ability. J. consult. Psychol., 
1949, 13, 127-133. 

2. Dymond, Rosalind F. Personality and empathy. 

J. consult. Psychol., 1950, 14, 343-350. 

Kounin, J. S. Experimental studies of rigidity: 

1. The measurement of rigidity in normal and 

feeble-minded persons. Charact. &@ Pers., 1941, 

11, 251-272. 

4. Murray, H. A. Thematic apperception test: 


Manual. Cambridge, Mass.: Harvard Univer. 
Press, 1943. 


Ww 


The Identification and Measurement of Predis- 
positional Factors in Crime and Delinquency” 


Harrison G. Gough 


University of California, Berkeley 


and 
Donald R. Peterson 


University of Minnesota 


The value of a theory or concept resides in 
the range and scope of the facts it is capable 
of subsuming, and in the extent, number, and 
accuracy of the hypotheses it implies. The par- 
ticular analogies employed, the vocabulary 
utilized, and the palatability of the constructs 
themselves are all of secondary importance 
when set against this first criterion. 


The role-taking theory of psychopathy 
which the senior author outlined in an earlier 
paper [1] represents an attempt to meet these 
first requirements in a direct and straightfor- 
ward manner. Wthout restating the details of 
this theory, it may be said that the role-taking 
theory did appear to synthesize and accommo- 
date all the known clinical and psychopatho- 
logical phenomena observed in psychopathy. 
Furthermore, it provided a basis for an infer- 
ential diagnosis, one freed from a barren de- 
pendence on any particular symptom or pat- 
tern of symptoms. 

The key element in the role-taking theory 
was an incapacity to look upon the self as a 
“social object”—-where the terms “self” and 
“social object” are used as defined by G. H. 
Mead—and the resulting failure to elaborate 
an adequate and realistic set of social expec- 
tancies and critiques. It is not contended that 
the psychopath, (as specified by role-taking 


1A preliminary version of this paper was read at 
the annual meeting of the Western Psychological 
Association at San Jose, California, April 28, 1951. 

*This study is part of a larger project in per- 
sonality assessment being carried out at the Uni- 
versity of California, Berkeley, by the senior author. 

The project is being supported, in part, by the 
National Institute of Mental Health, National In- 
stitutes of Health, U. S. Public Health Service. 


theory), is deficient in role-playing ability, in 
the sense of being unable to dissimulate, to 
feign, and to deceive others. Unfortunately, 
the language of the social psychological analy- 
sis here employed occasionally gives rise to 
that misperception. On the contrary, what the 
theory asserts is that the capacity to build up, 
to sustain, to integrate, and to organize the 
residuals which ordinarily accrue as a conse- 
quence of interactional experience is lacking.® 

Perhaps a word of caution should be inter- 
jected that we are not attempting to define the 
essential nature of psychopathy, or to explain 
just what psychopathy “really is.” The futility 
of such an enterprise is obvious. The present 
goal, rather, is to develop a workable, useful, 
and reliable theory which can, in fact, unify 
the generally accepted findings in the conven- 
tional use of the concept of psychopathy, and 
which can, moreover, be used in an efficient 
and practical way in understanding and pre- 
dicting the behavior of persons thus diagnosed. 

We should not, of course, neglect the possi- 
bility that the new perspectives opened up by 
the theory may suggest a change or modifica- 
tion in the diagnostic conventions previously 
employed. Such a revision has, indeed, oc- 
curred in the present instance. One cr: the ma- 
jor implications of the role-taking theory is 
that the essential diagnostic factor is not so- 


8The theory, it should be mentioned, is neutral 


with respect to the origin of this incapacity. It may 
be constitutional, developmental, idiosyncratic, re 
active, etc., in origin. The present argument is 


merely that the role-taking position is a heuristic 
way of conceptualizing the behavior specified. (For 
a discussion of the origins of role-taking skills, se¢ 


[3]). 


207 








208 


cial maladjustment, friction, or delinquency, as 
so often assumed, but rather a deficiency in 
role-taking capacity which is peculiarly liable 
to manifestation in social interaction. 

In the present study, the task is to apply 
role-taking theory to the problem of delin- 
quent and criminal behavior. It is not assumed 
that delinquency and psychopathy are synony- 
mous—actually each can, and should be, de- 
fined independently of one another—or that 
all delinquents are psychopaths. The only as- 
sumption made is that the total delinquent 
and criminal population includes a large 
enough proportion of psychopathic types to 
make feasible the application of role-taking 
theory. 

The particular goal was to determine wheth- 
er a personality assessment device based on 
role-taking theory could be developed which 
would yield reliable and dependable predic- 
tions of delinquent and criminal behavior. Suc- 
cess, or partial success, in such an effort would 
represent a rather important step forward. Ac- 
cording to a systematic review by Schuessler 
and Cressey [4] of 113 studies in the last 25 
years attempting to differentiate criminals 
from noncriminals on the basis of personality 
tests, the results of such endeavors have been 
consistently negative. Their over-all conclu- 
sion was that, ‘““The doubtful validity of many 
of the obtained differences, as well as the lack 
of consistency in the combined results, makes 
it impossible to conclude from these data that 
criminality and personality elements are asso- 
ciated.”” Concurrence with this opinion v. 
voiced in the last published paper of the noted 
criminologist, E. H. Sutherland [5]. 





Procedures 


The first step in the present study was the 
creation of a pool of personality test items 
which would incorporate the salient features 
of the role-taking theory, as well as items be- 
lieved on intuitive grounds to hold promise 
for the differentiation of delinquents from non- 
delinquents. The item “I often think about 
how I look and what impression I am making 
upon others” is an example of the former 
type, and the item “I used to steal sometimes 
when I was a youngster” is an example of the 
latter. This scale was then administered to a 
series of criminal, delinquent, behavior prob- 


Harrison G. Gough and Donald R. Peterson 


lem, and control samples, as follows: 


Males 


1. 43 high school boys from a small Minnesota 
community.‘ 

2. 125 high school boys from two Minneapolis 
schools.5 

3. 19 behavior problem boys from these three 
schools. 

4. 243 committed to the 

Youth Commission.® 


5. 698 reformatory inmates.” 


young delinquents 


Minnesota 


Females 
1. 44 high school girls from a small Minnesota 
community.‘ 
2. 134 high school girls from two Minneapolis 
schools.5 
3. 19 behavior problem girls from these three 
schools. 


105 reformatory inmates.® 


An item analysis of all control versus all de- 
linquent samples (one at a time, separately by 
sex) was carried out. Sixty-four of the items 
showed good differentiating power, and were 
retained for a delinquency “scale.” ‘These 
items, and the direction of response indicative 
of delinquency, are listed below: 


1. I get when I have to ask 


nervous someone 
for a job. (F). 2. I sometimes feel that I made the 
wrong choice in my occupation. (T). 3. I would 
never play cards (poker) with a stranger. (F). 4. 
I would have been more successful if people had 
given me a fair chance. (T). 5. I think Lincoln 


was greater than Washington. (F). 6. Life usually 
hands me a pretty raw deal. (T). 7. A person is 
off if he doesn’t trust anyone. (T). 8. My 
family has objected to the kind of work I do, or 
plan to do (T). 

9. Sometimes I used to feel that I would like to 
leave home. (T). 10. If the pay was right, I would 
like to travel with a circus or carnival. (T). 11. I 


better 


*Arranged through the kindness of Mr. Meyer 
Nelson of the Pillager, Minnesota, public schools. 

5Arranged through the kindness of Mr. H. H. 
Maass of Minneapolis South High School, and of 
Miss Jane D. Lewis and Mr. R. V. Nord of Minne- 
apolis Vocational High School. 

®Arranged through the kindness of Mr. A. Whit- 
tier Day, Director of the Minnesota Youth Com- 
mission. 

TArranged through the kindness of Warden H. 
B. Whittier of the Minnesota State Reformatory 
for men. 

8Arranged through the kindness of Mr. A. Whit- 
tier Day and Warden Emerald A. Harper of the 
Minnesota State Reformatory for women. 


T= |= 


— 


_ 


Predispositional Factors in Crime and Delinquency 209 


would do almost anything on a dare. (T). 12. As 
a youngster in school I used to give the teachers 
lots of trouble. (T). 13. My parents were too strict 
with me when I was a child. (T). 14. Even when 
I have gotten into trouble I was usually trying to 
do the right thing. (F). 15. My parents never real- 
ly understood me. (T). 16. My home life as a child 
was less peaceful than those of most other peo- 
ple. (T). 

17. I think I am stricter about right and wrong 
than most people. (F). 18. I often feel that I am 
not getting anywhere in life. (T). 19. Even the 
idea of giving a talk in public makes me afraid. 
(F). 20. It is very important to me to have enough 
friends and social life. (F). 21. I never worry 
about my looks. (T). 22. I hardly ever get excited 
or thrilled. (T). 23. I have very strong likes and 
dislikes. (T). 24. My parents have often disap- 
proved of my friends. (T). 

25. My home life was always happy. (F). 26. I 
often act on the spur of the moment without stop- 
ping to think. (T). 27. I seem to do things that I 
regret more often than other people do. (T). 28. I 
would rather go without something than ask for a 
favor. (T). 29. I have more than my share of 
things to worry about. (T). 30. I go out of my way 
to meet trouble rather than try to escape it. (T). 31. 
When I meet a stranger I often think that he is 
better than I am. (F). 32. It is pretty easy for peo- 
ple to win arguments with me. (T). 

33. Before I do something I try to consider how 
my friends will react to it. (F). 34. I have never 
been in trouble with the law. (F). 35. In school I 
was sometimes sent to the principal for cutting up. 
(T). 36. I keep out of trouble at all costs. (F). 37. 
I often think about how I look and what impres- 
sion I am making upon others. (F). 38. I find it 
easy to “drop” or “break with” a friend. (T). 39. 
I spend a good deal of time planning and thinking 
about my career. (T). 40. I enjoy work as much 
as play. (T). 

41. It is hard for me to act natural when I am 
with new people. (F). 42. When something goes 
wrong I usually blame myself rather than the other 
fellow. (T). 43. I have often gone against my par- 
ents’ wishes. (T). 44. I have never done any heavy 
drinking. (F). 45. I have been in trouble one or 
more times because of my sex behavior. (T). 46. 
When I work at something I like to read and study 
about it. (T). 47. Most of the time I feel happy. 
(F). 48. My table manners are not quite as good at 
home as when I am out in company. (F). 

49. I know who is responsible for most of my 
troubles. (T). 50. I get pretty discouraged with 
the law when a smart lawyer gets a criminal free. 
(F). 51. I have used alcohol excessively. (T). 52. 
When I was going to school I played hooky quite 
often. (T). 53. People often talk about me behind 
my back. (T). 54. I often feel as though I have 
done something wrong or wicked. (T). 55. I don’t 
think I’m quite as happy as others seem to be. (T). 
56. I used to steal sometimes when I was a young- 


ster. (T). 

57. 1 am somewhat afraid of the dark. (F). 58. I 
never cared much for school. (T). 59. The mem- 
bers of my family were always very close to each 
other. (F). 60. I sometimes wanted to run away 
from home. (T). 61. With things going as they 
are, it’s pretty hard to keep up hope of amounting 
to something. (T). 62. My parents have generally 
let me make my own decisions. (F). 63. I was often 
punished unfairly as a child. (T). 64. My home 
life was always very pleasant. (F). 


The 64 items appear to group themselves 
into several rather distinctive clusters, such as 
the following: 

1. Role-taking deficiencies, insensitivity to 
interactional cues and the effects of one’s own 
behavior on others. 

2. Resentment against family, feelings of 
having been victimized and exploited in child- 
hood. 

3. Feelings of despondency and alienation, 
lack of confidence in self and others. 

4. Poor scholastic adjustment, rebellious- 
ness. 


Not all of the items are necessarily included 
in these particular four clusters. A more sys- 
tematic statistical analysis of item intercorre- 
lations might reveal additional, or perhaps dif- 
ferent, clusters. The four clusters listed, how- 


Table 1 


Mean Scores on the De Scale for the 
Criterion Samples 














Samples N M SD 
I. Males 
A. Comparison 
1. School A 43 18.70 5.11 
2. Schools B and C 125 22.67 6.13 


B. Delinquent 
1. School behavior 
problems 19 28.95 6.87 
2. Youngdelinquents 243 30.40 6.20 
3. Reformatory 
inmates 698 29.83 6.65 
II. Females 
A. Comparison 
1. School A 44 16.20 4.80 
2. Schools B and C 134 18.85 6.23 
B. Delinquent 
1. School behavior 


problems 19 22.00 7.83 
2. Reformatory 
inmates 105 30.26 5.63 











210 


ever, do appear to be clearly suggested by a 
subjective appraisal and can probably be ac- 
cepted as having clinical validity. 

The test answer sheets of all subjects were 
next scored for this 64-item key, with the re- 
sults indicated in Table 1. The progression of 
mean scores in Table 1 evidences a clear trend 
from lower scores for the control samples 
through intermediate scores for the behavior- 
problem samples to highest scores for the overt- 
ly delinquent samples. Analyses of variances 
yielded F ratios of 62.5 for the male samples 
and 91.3 for the female samples. Both of these 
values are significant well beyond the .01 level 
of confidence. 


Table 2 


Screening Efficiency of the De Scale 








Proportion of cases which would 
be called “delinquent” with vari- 
ous cutting scores 





Scores Males Females 
Delin- Con- Delin- Con- 
quents trols quents__ trols 
N=—960 N=—168 N—124 N—178 

28 and over 63 17 60 7 
> « ag et 68 19 65 

_* -g 74 23 73 12 
-  Aihsg 4d 79 31 79 17 
24’ ‘ 83 39 $3 20 
hig 87 45 85 2 
i 90 51 89 25 





Combining all the delinquent and all the 
control cases, for each sex separately, permits 
the evaluation of the screening efficiency of the 
Delinquency Scale. Table 2 presents these 
data. For these samples it would be possible 
to identify as many as 75 per cent of the de- 
linquent and behavior problem subjects at the 
cost of only 23 per cent “false positives” for 
males and 12 per cent false positives for fe- 
males. A cutting score of 26 would correctly 
classify 78 per cent of the total sample of 1,430 
cases. The screening efficiency of the scale is 
thus quite good for these samples. 

The next step in the project was the obtain- 
ing of cross-validational evidence. Forty-two 
randomly chosen items, out of the total of 64, 
were administered to a sample of 1,092 in- 
coming recruits at Fort Ord, California.® The 


®Arranged through the kindness of Mr. Ernest 
Belden, Mr. Jack Danielson, and Dr. J. H. Clark. 


Harrison G. Gough and Donald R. Peterson 


Table 3 
Comparison of Army Recruits and Stockade 


Prisoners on an Abbreviated Form 
of the De Scale 











Scores Recruits Prisoners 
3-35 2 
30-32 5 
27-29 5 9 
24-26 20 13 
21-23 48 18 
18-20 102 15 
15-17 163 17 
12-14 284 14 
9-11 276 4 
6- 8 156 2 
g- § 37 
0- 2 1 
N: 1,092 99 
M: 12.75 20.01 
SD: 4.66 


6.11 


same item subset was also given to 99 stock- 
ade prisoners who were tested in a group on 
a single day. The distributions of scores for 
these two samples are given in Table 3. The 
difference between the two means is 7.26, and 
the critical ratio 11.52, again significant well 
beyond the 1 per cent level. 


Table 4 


Cross-validational Screening Efficiency of 
the De Scale 








Proportion of cases fall- 
ing above various cut- 
ting scores 





Scores Army Army 
Recruits Prisoners 
N=1,092 N—99 
20 and over 9 52 
19 ” as 12 57 
ZS ~ “4 16 63 
> Raa 22 66 
Mm: ie 26 71 
e * 7 31 80 
14 ” 6 38 84 





The cross-validational screening efficiency of 
the De scale is indicated in Table 4. The cut- 
ting score proposed for the original samples 
was 26, out of a possible 64. In the abbreviated 
version used in this cross-validation, a cutting 
score of 17 would be proportional. A division 


of 
it- 
les 
ed 
ng 


Predispositional Factors in Crime and Delinquency 211 


at 17 would properly identify 66 per cent of 
the prisoners and would misclassify 22 per 
cent of the controls. The corresponding figures 
for the male subjects in the original samples 
were 74 and 23, so it can readily be seen that 
the amount of “shrinkage” in cross validation 
has been remarkably small. An inspection of 
the other proportions in Table 4 will suggest 
various ways of using the De scale. For ex- 
ample, if one wished to minimize false posi- 
tives, but desired to identify at least 60 per 
cent of known prisoners, then a cutting score 
of 18 would be employed.*® 

A second cross-validational sample was ob- 
tained when a 58-item version of the scale was 
administered to 353 stockade prisoners at the 
Lackland Air Force Base, Texas." In this 58- 
item version a cutting score of 24 would he 
proportional to the score of 26 used for the 
total test. Sixty-four per cent of the Lackland 
prisoners scored at or above 24, and 36 per 
cent below. A control sample was not avail- 
able at Lackland, but it was possible to deter- 
mine irom the stockade records whether or 
not a »risoner had any previous military of- 
fenses. The “first offense” group included 209 
prisoners, and the repeaters numbered 144. 
The means and standard deviations were 25.74 
and 6.55 for the first offenders, and 28.19 and 
7.03 for the recidivists. The difference be- 
tween the two means was 2.45, critical ratio 
3.31, a value significant well beyond the .01 
level of probability. This difference, with the 
recidivists having higher scores, again confirms 
the theoretical expectations. 

The validity of the scale in both the original 
and the cross-validational samples thus ap- 
pears to be adequate.** The scale does differ- 
entiate delinquent from control samples, in 
terms of both mean differences and per cent 
overlap. The evidence from the Army samples, 
in fact, suggests that the scale possesses prac- 
tical screening efficiency. Furthermore, mili- 


1°The particular cutting scores discussed here 
would refer only to the abbreviated 42-item version 
used in the Army investigation. The expectation 
would be, however, that cutting scores for the total 
scale would be proportional to these. 

11These subjects were tested through the courtesy 
of Dr. A. S. Levine, and Major Grant Mann. 

12The only evidence for reliability so far obtained 
is an uncorrected split-half r of .72 for a random 
sample of 200 of the male reformatory inmates. 


tary recidivists score significantly higher, on the 
average, than first offenders. 

These findings with respect to validity may 
now be reviewed briefly in reference to the 
role-taking theory upon which the largest por- 
tion of the items was based. The theory dic- 
tated the exploration of new areas of item con- 
tent, with special emphasis on social interac- 
tion and expectancies. Many of the items de- 
rived from these considerations were pheno- 
typically unrelated to criminal and delinquent 
behavior; that is, they made no reference to 
legal or illegal acts, rules, acceptance of au- 
thority, and the like. Yet, in spite of this pheno- 
typical irrelevance, items such as these did, in 
fact, turn out under empirical analysis to re- 
late to delinquency in the manner specified by 
the theory. Evidence such as this, perhaps par- 
ticularly the practical screening efficiency of 
the scale, constitutes a strong argument for 
the utility and pragmatic value of the role- 
taking theory of psychopathy.** 

Any number of interesting follow-up stud- 
ies are suggested by the results of the present 
investigation. For example, one would predict 
that low-scoring prison inmates on the De 
scale would be better parole risks than higher- 
scoring inmates. Or, a systematic comparison 
of nondelinquent but high-scoring subjects ver- 
sus equally high-scoring and overtly delinquent 
subjects could be made. One would expect to 
find some interesting Rosanoff-type control 
factors in the former group. Then there is 
always the possibility of a configural analysis 
of the dimension isolated by the De scale with 
other variables. Consider, for example, the dif- 
ferences one would expect to find between sub- 
jects high on the De scale and on a femininity 
index, and subjects high on De but low on the 
femininity index. 


183A conclusion such as this is not intended to 
rule out the possibility of accounting for the obser- 
vations by other theories and points of view. The 
unique contribution of the role-taking theory in this 
instance was that it indicated a specific new area 
of item content, one which turned out, under analy- 
sis, to possess considerable validity. Once these re- 
lationships have been demonstrated, their meaning 
and significance can undoubtedly be fitted into and 
explicated by other viewpoints. The value of the 
role-taking theory is not that it is the only one 
which can “explain” these relationships, but rather 
that it led directly to their discovery and identifi 
cation. 








Adjectives 

In using a scale such as the one for delin- 
quency described here, or any other instrument 
for that matter, one of the needs of the user 
is to know what significance to attach to high 
and low scores, over and above the general 
implications which derive from the test’s va- 
lidity and reliability. 

In an attempt to provide information of 
this type for the De scale, an exploration of 
the “social stimulus values” of high and low 
scores was undertaken. That ‘s, the question 
was raised, “What sort of impression can a 
person who scores either high or low on the 
De scale be expected to make on his friends 
and associates ?” 


In order to accomplish this phase of the 
study, the scale was administered to 40 sub- 
jects who were participating in an intensive 
project being conducted by the Institute of 
Personality Assessment and Research, at the 
University of California, Berkeley..* From 
this group of 40, the 10 highest and 10 lowest 
scoring subjects on the De scale were selected 
for special consideration. Then a systematic 
comparison was made of what observers had 
said about each subject, in order to discover 
consistent and reliable differences between the 
high and low scorers. 

During the assessment periods, each of six 
Personality Institute staff members had com- 
pleted an adjective check list [2] for each 
assessee, giving as complete and accurate a 
picture as possible of his reactions to and con- 
ceptions of the assessee’s personality structure 
and characteristics. The six “‘observer check 
lists” for each assessee were then consolidated 
into a single “composite observer check list” 
by considering each adjective with checks by 
two or more of the staff as being “present,” 
and those with one or no checks as being “ab- 
sent.”” The composite observer check lists for 
each of the 20 designated assessees were then 
split into the two criterion subsamples and an 
item analysis of the adjectives was completed. 

All adjectives showing a statistically signifi- 
cant difference between the two subsamples 


14The study referred to is devoted to an investi- 
gation of factors making for success in graduate 
training and later professional life, and is being 
carried out under a grant from the Rockefeller 
Foundation. 


Harrison G. Gough and Donald R. Peterson 


were selected, and are listed below: 


\. Adjectives checked more frequently about high 
scoring subjects. 
cnsitive 


iffected emotional] 


nxious headstrong 


defensive 


tense 
persevering ary 
rebellious 

checked 


ring subjects. 


dissatisfied 


B. Adjectives more frequently about 


calm good-natured obliging 
iderate helpful patient 
conventional moderate neaceable 
dependable modest tactful 
frank natural unassuming 


The social stimulus value of higher scores 
on the De scale appears to include an empha- 
sis on characteristics such as dissatisfaction, re- 
belliousness, defensiveness, etc. On the other 


hand, 


lower scores “come through” socially 
as considerateness, dependability, moderateness, 
patience, and tactfulness. High and low scores 
on the De scale can, accordingly, be taken as 
predictive of the two patterns of social stimu- 


lus values indicated. 


Summary 


A role-taking theory of psychopathy was ap- 
plied to the practical problem of the identi- 
fication and measurement of predispositional 
factors in crime and delinquency. An assess- 
ment device was constructed which was ca- 
pable of differentiating significantly between 
delinquents and controls in both original and 
cross-validational samples. 

Some possibilities for additional studies were 
suggested, and an analysis of the “social stim- 
ulus values” of the test instrument was con- 
ducted. 


Received October 10, 1951. 


References 


1. Gough, H. G. A sociological theory of psycho- 
pathy. Amer. J. Sociol., 1948, 53, 359-366. 

2. Gough, H. G. An adjective check list for use 
in personality research. (To appear). 

3. Sarbin, T. R. A preface to a_ psychological 
analysis of the self. Psychol. Rev., 1952, 59, 11- 


29 


4. Schuessler, K. E., & Cressey, D. R. Personality 
characteristics of criminals. Amer. J. Sociol., 
1950, 55, 476-484. 

Sutherland, E. H. Critique of Sheldon’s ‘Vari- 
eties of Delinquent Youth.’ Amer. sociol. Rev., 
1951, 16, 10-13. 


Test Versus Academic Performance in Mal- 
functioning Students 


Barbara A. Kirk 


Counseling Center, University of California, Berkeley 


Students whose academic achievement is not 
commensurate with their capacities have been 
the recipients of much concern and study. ‘The 
assumption is prevalent that they are “malad- 
justed.”’ It is the purpose of this paper to de- 
scribe the basic symptomatology when the dis- 
crepancy between achievement and capacity is 
severe and chronic, and to depict the related 
problems of measurement of capacity in stu- 
dents who are not performing well academic- 
ally, and whose test performance would there- 
fore be clinically suspect. 

Earlier concepts of the effect on mental ca- 
pacity of emotional illness have largely been 
discarded. In psychiatric texts as late as the 
third decade of the twentieth century, it is 
stated that psychoses such as dementia praecox 
caused, resulted in, or were associated with 
permanent mental deterioration. It is also im- 
plied that emotional illness of whatever sever- 
ity affected mental functioning in such a way 
as to impair it. Accumulated experience of 
both psychological and psychiatric nature has 
caused most practitioners to recognize and ac- 
cept the tenet that emotional maladjustment or 
illness may affect the patient’s performance or 
use of his mental capacity, but not actually re- 
duce his endowment. 

In determining intellectual capacity through 
testing, it has been erroneously supposed that 
timed “paper and pencil” tests of mental abil- 
ity or academic aptitude represent an individ- 
ual’s ability to function at a given time, but 
that an individual verbal test of intelligence, 
clinically administered with clinical interpre- 
tation, represents the best estimate of the indi- 
vidual’s native capacity. The discrepancy be- 
tween an adult’s ability to perform in these 
two situations could be thought of as an esti- 
mate of his degree of maladjustment, or of his 


anxiety or state of emotional tension at the 
time of administration of these two kinds of 
measurement. Thus it is expected that results 
on a timed “paper and pencil” test of ability 
will more closely parallel academic perform 
ance than those on a test more representative 
of true capacity. 

That “paper and pencil” tests of academic 
aptitude will not accurately predict academic 
performance for the individual is well known. 
There is extensive literature on the relation- 
ship of such a venerable test as the American 
Council on Education (Thurstone Psycholog- 
ical) Examination with academic grades as the 
criterion of academic performance, showing 
correlations from .45 to .55. It is presumed, 
although there has been no investigation, that a 
test of academic aptitude requested by a stu- 
dent who has voluntarily sought counseling in 
a student counseling service should bear a clos- 
er relation to performance than one taken in a 
mass testing program with uneven motivation 
and in all probability without optimum per- 
sonal or physical test conditions. 

Contrary to these clinical assumptions, psy- 
chological and psychiatric clinics and counsel- 
ing services on college campuses are not at all 
unfamiliar with the poor college achiever, 
whose maladjustment is apparent and rather 
severe and who, nevertheless, is outstanding 
in any sort of test performance. This appears 
to be a curious contradiction, but its recurrence 
compels attention. 

It is expected that an underachieving stu- 
dent whose capacity is not in question will do 
well on tests of intellectual capacity or aca- 
demic aptitude. It is now, however, the usual 
pattern that he will do outstandingly well on 
“paper and pencil” tests. When this occurs, as 
it does, not too infrequently, one may conjec- 


213 








214 


ture that some special explanation may be in- 
volved. 

In reviewing a series of such cases, certain 
impressions are derived. Certain statements 
appear so recurringly in the interview or case 
summaries that one finds oneself searching for 
the customary report. In these records the aca- 
demic deficiency is always emphasized. This is 
no borderline deficiency, but rather a severe 
academic deficiency, serious enough to have oc- 
casioned administrative action such as a place- 
ment on probation. The counselee demonstrates 
no real recognition or admission of the reasons 
for this deficiency, but, on the other hand, he 
evidences excellent insight into his abilities, and 
no surprise at the results of the tests of aca- 
demic aptitude. The explanation and excuses 
for the academic deficiency are unrealistic, 
superficial, and largely implausible. This com- 
bination of reactions may be epitomized as the 
counselee’s echoing of what he may have heard 
—that he does not know why he is doing 
poorly, because he knows he has the ability. He 
may or may not be concerned or anxious about 
his situation, but he is still unaware of the 
reasons for it. He may also be surprised that 
he was not tense or bothered on tests admin- 
istered to him during counseling because he 
frequently has been tense or bothered during 
academic examinations. If pushed in counsel- 
ing, the counselee may admit that this could 
be willful failure, but will protest that it is 
completely unknown to him, and, as a matter 
of fact, this explanation does not sound rea- 
sonable to him. 

In some cases there have been previous at- 
tempts at therapy. If so, the counselee will 
say that the therapy did not get at the root of 
the trouble, and that he is really unable to re- 
member much of what was discussed in ther- 
apy. 

Universally in these cases, the counselor re- 
ports that it was a matter of extreme difficulty 
to obtain any direct discussion of the problem 
with the counselee, however obvious and ap- 
parent the problem, and however voluntarily 
the counselee had sought counseling. More- 
over, the recurrent report is that it was ex- 
tremely difficult to obtain a description or dis- 
cussion of any of the counselee’s feelings, or 
even in many cases, situations or vicissitudes 
which might be expected to occasion strong 


Barbara A. Kirk 


counselee reactions. 

A case has been selected for presentation as 
illustrative. The student was counseled at the 
Counseling Center of the University of Cali- 
fornia, Berkeley, by Mr. Arthur L. ‘Traphagen, 
a Senior Counselor. Unlike many such cases, 
the counselor here was finally successful, after 
long and skillful effort, in securing a verbali- 
zation of what has been hypothesized to exist 
in other cases. 

In all probabilty, the resistance and the re- 
pression ordinarily noted are directed towards 
a strong effort to control an excess amount of 
hostility. The academic failure probably has 
meaning in terms of unconscious satisfaction 
of the hostility usually directed towards some 
member of the family who demands success, 
while the excellent scores obtained on tests 
taken in a counseling situation may be inter- 
preted also as hostile gestures. Because no im- 
portance in satisfying administrative require- 
ments is attached to these tests, the counselee 
is free to do with them as he wishes. It is a 
declaration, perhaps, of the lack of significance 
of his academic failure, and a flaunting of this, 
that permits him to do a superb job in his 
testing during the counseling process. He has, 
in many of these cases, come voluntarily for 
counseling so that, in all probability, the coun- 
selor is also involved in this acting-out process. 
He has perhaps undertaken to show the coun- 
selor how great the problem is and how much 
he dares him to approach it. 

The psychiatric service of the U. C. Stu- 
dent Health Service recognizes this syndrome. 
‘The impressions of Dr. Harold Renaud, Chief 
Psychologist, regarding the Minnesota Multi- 
phasic Personality Inventory records in these 
cases are: 


On the basis of experience with 1500 records col- 
lected here [the Student Health Psychiatric Serv- 
ice], the kinds of MMPI records most frequently 
associated with poor academic achievement are as 
follows: 

Most frequent of all is what we have here desig- 
nated as “psychoneurosis with compulsive and de- 
pressive features.” (For us such a record is char- 
acterized by a mild K elevation, a distinct D spike, 
and Pt as high, or higher than Sc. In general we 
would judge the degree of difficulty in productiv- 
ity to increase rather directly until scores of 80 and 
above are met with, at which point immobilization 
frequently becomes of incapacitating degree.) So 
far as the dynamics of such patients are concerned, 


Malfunctioning Students 215 


they tend to be pervasively resistant on an uncon- 
scious level to any externally imposed task. Since 
childhood, however, they have concealed such re- 
sistances from themselves and others by a facade 
of hard-workingness, meticulousness, and earnest 
dutifulness. They are frequently able to “keep go- 
ing’ through high school, but in the unstructured 
environment of a university, the loss of the con- 
tinued external pushing of teachers and parents 
permits the overthrow of the process of grudging 
achievement, and the resistances then manifest them- 
selves in nonperformance concerning which the 
patient is very self-derogatory. (This group is well 
described in Horney’s section on work inhibition.) 
The opinion so frequently expressed in the litera- 
ture that compulsive personalities require protracted 
treatment coincides with our experience here. 

lhe second most frequent group of nonperform- 
ers here at the clinic seem to have records which 
we designate as “hysterical personalities.” For us 
such a record is characterized by a distinct eleva- 
tion at K, frequently a moderate elevation of L and 
no important emphasis on F. The left side of the 
record is characterized by an Hy spike with the 
remainder of the record being of indifferent eleva- 
tion. Such patients, in contrast to the first group, are 
not characterized by depression, or self-derogation, 
but are rather prone to the repression and suppres- 
on of any emotional problems. They picture them- 
selves as coming from “just normal, average fam- 
ilies,” and tend to be devoid of any but superficial 
complaints. They naively exclude from awareness 
the hostile potentialities of social interactions by 
maintaining that their relations with their friends, 
professors, etc., are “just fine.” Because of their 
naive tendency to “accentuate the positive,” such 
patients are ordinarily very poorly motivated for 
treatment, and from this point of view are able to 
make less use of psychotherapy than members of 
the first group. 


The following illustrative case falls mainly 
in the second group. 


The Case of Mr. I 


Mr. I is 21 years old, friendly, and nice appear- 
ing. He is a third-year student in the College of 
Chemistry in chemical engineering and is appre- 
hensive that he might be dropped from its roll. He 
inferred that he sought counseling of his own vo- 
lition because of inability to achieve academically 
when he knew he was capable of good work. Ac- 
tually, the counseling was brought about under 
pressure from the Dean of Students Office, after 
Mr. I’s mother had conferred with the Dean sev- 
eral times. In October, 1948, he had contacted the 
Dean of Students because his academic life had 
gotten into a “terrible mess,” he was despondent, 
and was afraid he might break mentally. He 
was referred to the Psychiatric Service of the Stu- 
dent Health Service, which recommended he be 
permitted to withdraw from the University. Subse- 


quently, at his mother’s insistence, he saw a psychi- 
atrist for three visits and was told, he reported, 
that nothing was wrong and psychiatric treatment 
was not warranted, 

He re-entered the University in February, 1949. 
An exploratory testing program was planned at the 
Counseling Center, including the Strong Interest 
Blank, the ACE Psychological Examination, 1946 
College Edition, the Ohio State Psychological Ex 
amination, 1936 Edition, the Stanford Scientific 
Aptitude Test, and the Wrenn Study Habits In 
ventory. 

The interview following testing began with Mr. 
I stating he felt the tests were probably of no help, 
that he anticipated the Strong would confirm aca- 
demic work in the College of Chemistry, and the 
tests of capacity or ability would show very high 
scores, as they always did. He had evaluated his 
performance correctly, scoring at the 99th perce 
tile on the ACE for male college freshmen nation 
ally, and 99th percentile on the Ohio State as com 
pared with Ohio State University freshmen, with 
a score on the Stanford Scientific Aptitude Test 
comparable to a mean score for college science fac 
ulty. When he was informed that he had guessed 
the outcome of the tests, he became serious, ap- 
peared to be concerned and anxious as to why he 
could do well on tests like this and not obtain pass- 
ing grades academically. He was pressed for fur- 
ther informatior concerning poor academic grades. 
His first semester, with little study, he had obtained 
an A, a B, and three Cs. The next semester he had 
obtained two Cs, and two Fs. The following semes- 
ter he had received Cs and Ds, and from then on 
had stayed approximately in this category. 

The approach then taken by the counselor was to 
determine if possible why the man was in college, 
whether it was his own desire to be here, or wheth 
er he was here under duress from his parents, and 
if he had any explanation as to why he was not 
achieving. He stated it was both, but mainly be- 
cause he desired to be a research chemist, and rec- 
ognized the need of professional training and a 
college degree to attain this goal. The counselor 
then inferred that his proposed field was akin to 
his father’s (a civil engineer) and possibly he was 
attempting to emulate or compete with his father. 
He negated this too quickly, attempting to attribute 
his lack of achievement to a dislike for the regi- 
mentation found in the College of Chemistry. At 
this point the counselor stated he felt this was not 
the whole story, that to have the ability which he 
apparently had, yet not be able to perform, must be 
the result of some pressure or disturbance. If he 
had no inkling as to causative factors, it would be 
advisable to investigate. The Sentence Completion 
Test was described and its use was suggested as a 
means of obtaining his feelings and attitudes to- 
wards a variety of subjects. He expressed a will- 
ingness to cooperate, stating he would welcome any 
investigation as to cause. His sincerity was doubted, 








216 


but the test was administered. 

The next time he was seen, Mr. I opened the 
interview with a question as to what the test 
showed. The counselor gave him the general im- 
pressions which the test presented, i.e., a person 
desiring success vocationally and _ socially, but 
afraid to compete in these areas for fear of suc- 
ceeding because of the obligations such success 
would place upon him. He was not placing himself 
in situations where there was a chance to have his 
abilities honestly evaluated. Instead, he was refus- 
ing to achieve and refusing to meet the necessary 
competition, all the time rationalizing this refusal 
to reduce the guilt he felt for not succeeding when 
he knew he could. The counselor stated he felt 
many of the sentence completions were made with 
tongue in cheek. Mr. I laughed and agreed, then 
became serious when the counselor began to pick 
out specific statements for discussion, especially in 
the area of family. This was done to see if he 
would talk about family background and volun- 
tarily release information which the counselor had 
received from the Dean of Students, When these 
statements had been read, he asked, “Did I say 
that?” He was shown the test in his own writing 
and said he had no idea he had made those state- 
ments. When questioned directly about relations 
with his father, he stated they were excellent “now.” 
Further probing revealed that his father had suf- 
fered two mental breakdowns, one when Mr. I 
was in junior high school, and again when he was 
a junior in high school. Having gone through this 
ordeal, with the father at home, he had developed 
fear that the same thing would happen to him if 
he patterned himself too closely after his father. 
A great deal of affect did not appear to be attached 
to these statements, so cther items were cited. When 
questioned about a brother, who is 25, has com- 
pleted only 8th grade, and is a neighborhood gar- 
dener, the key question had apparently been found. 
A surprising display of emotion, including hatred, 
guilt, and shame, was released as though the man 
had little control over what he was saying. He 
talked for a long time, concluding with the obser- 
vation that this was the first time he had ever, to 
his knowledge, permitted himself fully to express 
his feelings in an uncontrolled fashion, so that he 
himself had to face them, and that the counselor 
was the only person to whom he had ever talked 
about his brother. The content of the diatribe in- 
cluded the facts that his brother received a brain 
injury at birth. He referred to it as a speech in- 


Barbara A. Kirk 


jury, but then stated that this brother acts like a 
ten-year-old. The whole family feels guilty and 
will not admit that the injury is something which 
cannot be corrected by proper treatment. The moth- 
er feels guilty for having produced such a child. 
The father refuses to admit that a member of his 
family could be mentally defective, and in spite of 
having spent a fortune on physicians, psychiatrists, 
and speech pathologists, refuses to treat his son in 
any manner other than as if he were normal and 
fully capable of responsibility for his actions. This 
brother had been placed in the same public school 
as Mr. I, and as long as he was in school he had 
been taunted by the other children for having a 
“dummy” brother. Because of the actions of the 
father, Mr. I had tried to be his brother’s protec- 
tor. He now feels it was due to his own feeling of 
guilt and possible hatred for his brother who has 
deprived him of so much socially and economically. 
Because of this financial drain on the family, Mr. 
I has to finance his own schooling and support him- 
self in all ways except for housing. 

During his discussion with the counselor, he be- 
gan to wonder if his academic performance was 
not a form of rebellion and escape, an unconscious 
punishment of the father and family because they 
were so strongly looking to him to achieve for the 
family and thereby minimize his older brother’s 
inabilities. At the end of the interview, although 
he was stirred up, he felt considerably relieved in 
that he felt he was beginning to understand many 
things about himself which he had never under- 
stood before. He had begun to get insight into the 
nature of his problem and his actions and stated he 
was going immediately from here to the student psy- 
chiatric service to see if they would accept him 
and work with him in straightening out his feel- 
ings and attitudes because he knew full well that 
it was beyond his capabilities. 


The counseling implications are clear. De- 
spite the counselee’s resistance and rejection of 
the problem, it is the counselor’s responsibility 
to do everything in his power to carry it to 
him in such a way that the preparation may 
be made for psychotherapy. The counselor 
must do so in spite of the fact that the success 
of either counseling or therapy cannot be too 
optimistically regarded. 


Received June 11, 1951. 


——————————S— sell ll CUD 


Changes in Stanford-Binet IQ of Retarded 
Six -Year-Olds 


Mary Isabel Elwood 


Pittsburgh Public Schools 


Although the general reliability of the Re- 
vised Stanford-Binet Scale has been well es- 
tablished, evidence has been published indicat- 
ing atypical variability of 1Q distributions at 
certain chronological age levels. One of these 
levels occurs at five and six years, when the 
variability seems to be particularly low. Ter- 
man and Merrill [3, pp. 38-41], in the origi- 
nal publication of the scale, point out that, 
while the standard deviation of 1Q’s may be 
accepted generally as slightly over 16 points, 
the standard deviation at age six is 12.5 for 
Form L and 13.2 for Form M. For six-year- 
old children whose IQ’s are far above or below 
the average range, the 1Q’s obtained are too 
close to 100. Thus, superior children, if re- 
tested later, would be expected to show con- 
siderably higher 1Q’s, and retarded children 
considerably lower ones. McNemar [2, p. 172- 
174] has provided a table of IQ adjustments 
for these variability differences. His table ad- 
justs obtained IQ’s of children of chronologi- 
cal age 4-10 to 6-6 and IQ 50 to 79, down- 
ward from 5 to 11 points. If a child in this 
chronological age range earns an IQ of 79, it 
should be adjusted to 73, while an obtained IQ 
of 50 at the same chronological age should be 
adjusted to 39. The general use of these ad- 
justments is recommended by Goodenough [1, 
p. 169], although they are probably not wide- 
ly used. 

Public school psychologists have an unusual 
opportunity to examine the validity of these 
corrections at the lower IQ levels, since the 
question of postponement of entrance to first 
grade provides occasion for retesting many chil- 
dren who were first tested at about age six. 

The school law of Pennsylvania permits 
school districts to refuse entrance to the first 
grade of any child whose mental age on an 


individual test administered by a (state) cer- 
tified public school psychologist or psychologi- 
cal examiner, is less than five years. Most of 
these children are retested before they are fi- 
nally admitted to first grade. Of those ad- 
mitted without a retest, many are tested some- 
what later for placement in a special class or 
for some other adjustment purpose. Since test- 
retest data for a considerable number of chil- 
dren are thus available in Pittsburgh, it has 
seemed worth while to determine to what ex- 
tent the second test supports McNemar’s rec- 
ommended adjustments. 


In compiling the following tables, records 
have been used for all children whose first test 
was given between the chronological ages of 
4-10 and 6-6, whose IQ at that time was be- 
tween 50 and 79, and whose second test was 
given at any chronological age above 6-6. The 
only records omitted were those of children 
who had marked physical handicaps which 
might invalidate test scores. So far as it is pos- 
sible to determine, no selective factor entered 
into the picture, except that of geographical 
location—they were all Pittsburgh children. 
All tests were administered by one of three 
experienced school psychologists, and usually, 
although there were exceptions, both tests were 
administered to the same child by the same 
examiner. In all cases, Form L was used for 
the first test. The retest form in some cases 
was L, and in some, M. 

The total number of children included in 
the study is 306. Of these, 77 had IQ's be- 
tween 50 and 59 inclusive on the first test, 
109 had IQ’s between 60 and 69, and 120 
had IQ’s between 70 and 79. Because the rec- 
ommended correction increases in magnitude 
as the obtained IQ moves farther from 100, 
these three groups are here studied separately. 


217 








218 











Table 1 
Chronological Age at First and Second Test 

> E < be - 

5 z eS 

= “4 = N ~ 

=) < 2 < 2 

= 2 VU a. ©) & 
70-79 120 5-1to6-6 6-1 6-7 to12-8 8-3.3 
60-69 109 410to6-6 6-09 6-7to14—9 7-8 
50-59 77 411to6-6 6-0.2 6-7 to12-6 7-11.4 


Total 306 





Table 1 shows the chronological ages for 
the three groups at the first and second tests. 
Although the largest number of retests oc- 
curred within two years after the first test, it 
will be seen that there was a great deal of scat- 
ter in this respect. 


Table 2 


1Q’s on First and Second Test 











a. ry a ) es 0 

= 30 eo) oo oo. o> 

6 - gm g gS 

os a6 UCU SklC KlC KS SS 
- > ~ = = a2 2\|l1ns 
70-79 120 70-79 74.9 59-96 73.6 —1.3 7.1 
60-69 109 60-69 65.3 47-86 67.6 +2.3 7.2 
50—59 77 50-59 55.2 37-73 56.9 +1.7 8.4 
Total 306 





Table 2 shows a comparison of IQ’s for the 
same three groups on the first and second tests. 


Table 3 
Changes in IQ 











= A a S 

J 3 d d bi 

1 ae Me 4. 

. = go S| an 

_ & s [- s 

+ 16—20 1 1 2 
+11-15 3 8 7 18 
+ 6-10 9 23 15 47 
+ 1-5 29 38 19 86 
0 15 5 14 34 

— 34 22 17 73 
— 6-10 19 10 4 33 
—11-15 10 2 1 13 
Total 120 109 77 306 





Mary Isabel Elwood 


It will be seen that, while differences in indi- 
vidual cases ranged from +19 to —15, the 
differences between the means are slight and 
not statistically significant. 

Table 3 shows distribution of changes in 
IQ for individuals in each group. Of the en- 
tire group, 63.1 per cent showed a difference 
of no more than five IQ points between first 
and second tests; while 89.2 per cent showed a 
difference of no more than ten IQ points be- 
tween the two tests. 

Retest IQ’s averaged slightly higher for all 
three groups when the same form of the scale 
was used than when the retest was made with 
a different form. This seems to indicate a 
slight practice effect, even after a considerable 
time interval. These differences were: 1.2 
points at IQ 70-79, 1.98 points at IQ 60-69, 
and 1.90 points at IQ 50-59. None of these dif- 
ferences is statistically significant. 

The only group which showed a slightly 
lower average IQ on the second test was the 
70-79 IQ group, which also had a little longer 
interval between tests, suggesting a relation- 
ship between time elapsed and magnitude of 
second IQ. The effect of time interval has been 
studied for the entire 306 cases, and the re- 
sulting r was found to be —.36 +.05. Thus, 
there is a low but statistically significant neg- 
ative correlation between time elapsed and 
magnitude of the second IQ. 


Coefficients of correlation between first and 
second IQ have not been included, because of 
the restricted sample used. Not only is the 
group studied drawn from a very narrow IQ 
range, but the distribution of IQ’s within that 
range is not a normal one. 


Summary 


Three hundred and six Pittsburgh school 
children have been tested on the Revised Stan- 
ford-Binet Scale, first at a chronological age 
between 4-10 and 6-6, and later at a chrono- 
logical age of 6-7 or more. All of these chil- 
dren had 1Q’s between 50 and 79 on the first 
test. The differences in IQ between the two 
tests were slight and not statistically signifi- 
cant. There was no consistent drop in IQ 
shown on the second test. 


Retests using the same form of the scale 
showed slightly higher IQ’s than those using 


er Tr te Ue = 


Changes in IQ of Retarded Six-Year-Olds 219 


the other form, which suggests a little more 
practice effect than might be expected among 
young retarded children. 

‘There was a low but statistically signifi- 
cant negative correlation between magnitude 
of second IQ and time interval between tests. 


Received August 3, 1951. 


References 


Goodenough, Florence. Mental testing. New 
York: Rinehart, 1949. 

McNemar, Q. The revision of the Stanford- 
Binet Scale. Boston: Houghton Mifflin, 1942. 
Terman, L. M., & Merrill, Mavd E. Measur- 


ing intelligence. Boston: Houghton Mifflin, 
1937, 








Psychoneurosis and Psychosomatic Reactions: 
A Rorschach Contrast ™ 


Leo Shatin 


Veterans Administration Hospital 
West Roxbury, Mass. 


Modern psychiatric practice takes cogni- 
zance of somatic reactions which are related 
to intrapsychic conditions [5]. These psycho- 
somatic syndromes are classified as somatiza- 
tion reactions and differentiated from other 
psychoneurotic categories. 


The present study compares the Rorschach 
productions of psychosomatic patients with 
those of patients showing (broadly) psycho- 
neurotic types of reaction. It investigates the 
hypothesis that distinct psychological differ- 
ences do exist between these two broad diag- 
nostic groupings, and attempts to determine 
differential psychological characteristics as re- 
vealed in projective (Rorschach) responses. 
The essential comparison is between two 
groups which have displayed personality dis- 
turbances; in one group this disturbance is 
associated with physiological dysfunction and 
even structural change. while in the second the 
difficulty is manifested in non- or minimally 
somatic symptoms and intrapsychic difficulties 
diagnosed as psychoneurotic disorders. 


Procedure and Results 


The subjects were drawn from a larger se- 
ries of 90 male veteran patients in a general 
hospital, referred during 1948-1949 [3] for 
psychological evaluation of emotional and per- 
sonality difficulties. Selection was made on the 


1Condensed from a portion of a dissertation sub- 
mitted to Harvard University in partial fulfillment 
of the requirements for the Ph.D. degree. Appreci- 
ation is expressed to Dr. R. W. White for his sug- 
gestions and encouragement. 

2Sponsored by the Veterans Administration and 
published with the approval of the Chief Medical 
Director. The statements and conclusions published 
by the author are a result of his own study and do 
not necessarily reflect the opinion or policy of the 
Veterans Administration. 


basis of the final diagnostic summaries. From 
the subgroup of patients with psychoneurotic 
reactions all cases were selected in which there 
wa. no indication of a somatization reaction 
and no medical syndrome that might be con- 
sidered psychosomatic. The great majority of 
these were diagnosed as psychoneurotic reac- 
tion types. Several were character disorders, 
but they too displayed emotional disturbances 
of a psychoneurotic nature. This group (PN) 
numbered 33 (Table 1). 

















Table 1 
Diagnostic Composition of Psychoneurotic Group 
(PN) 

Diagnosis No. 
Acute situational reaction 2 
Anxiety reaction 13 
Anxiety reaction with obsessional and 

phobic features 1 
Character disorder (with psychoneurotic 
reactions) 2 
Conversion reaction 2 
Conversion reaction in an inadequate 
personality 1 
Depressive reaction 2 
Dissociative reaction 1 
Hypochondriacal reaction 3 
Involutional depressive reaction, recurrent 1 
Obsessive-compulsive reaction 1 
Phobic reaction and conversion symptoms 1 
Psychoneurosis, mixed 2 
Psychoneurosis with depressive moods 1 
N=33 





The psychosomatic group (PS) was drawn 
from among the psychosomatic conditions, the 
patients showing somatic reactions in associa- 
tion with emotional or psychic disturbance. All 
cases were dropped in which the somatic condi- 


220 


Ip 


eS! 
Ww 
w 


| 


Psychoneurosis and Psychosomatic Reactions 221 


tion played only a minor role with psychoneu- 
rosis as the main diagnostic consideration, or 
in which the somatic condition was not sharply 
delineated. This group numbered 24 (Table 
2). 














Table 2 
Diagnostic Composition of Psychosomatic Group 
(PS) 

Diagnosis No. 
Bronchial asthma 4 
Irritability of colon 1 
Neurodermatitis 2 
Peptic or duodenal ulcer 8 
Psychogenic cardiovascular reaction with 

essential hypertension 1 
Psychogenic gastro-intestinal reaction 1 
Rheumatoid arthritis + 
Somatization reaction, gastritis 1 
Ulcerative colitis 2 

N24 





The PS and PN samples do not include all 
the possible classifications which might be sub- 
sumed under these headings, nor are the sub- 
categories equally represented. It should also 
be kept in mind that the subjects of this study 
are individuals who found it necessary to seek 
hospitalization, with the majority of the PS 
cases revealing structural change. There may 
be more than a simple quantitative difference 
between the person who successfully carries on 
the daily tasks of living albeit in somewhat 
limited fashion, and the person who becomes 


so incapacitated as to require hospital treat- 
ment. 


These two groups were equivalent in age, 
education, and intelligence (Table 3). They 
were comparable for productivity in the Ror- 
schach test; there was no significant difference 
in total number of responses (chi-square test 


of independence, p = .5). 

The Rorschach tests administered to these 
patients were scored for 39 variables that pro- 
vided a cross section of Rorschach scoring cate- 
gories including the standard ones. The vari- 
ables are summarized*® below. 


Rorschach Scoring Variables 


1 W% 

2. Dd% 
§+-s—Number of responses incorporating white 
space. 

4. F+% 


WV 


F%—FM and m are treated as F for this ra- 
tio. 


6. M 

7. FM 

8. m 

9. FC—Total number, without weighting. 

10. Sum C, weighted 

11. Uncontrolled color 

Definition Score 
No color responses in protocol 0 
FC present, but no CF or C 1 
Number of FC2CF+C 2 
FC present, but fewer in 
number than CF+C 3 

CF and/or C present, but no FC 4 

2 ¢ 

a. ¢€ 

14. V—Excluding &. 

15. k 


16. Y—Excluding C’, c, k. 

17. C’+c+V+k+Y—Total shading and achro- 
matic emphasis. 

18. M—Sum C—Relationship of M to Sum C, 
expressed as arithmetic difference. Negative 
score expresses greater color emphasis. 

19. M+Sum C 

20. FM-+m:M—High score expresses emphasis on 
movement other than human; low score ex- 
presses greater relative emphasis on human 
movement. 


8Detailed specifications are described elsewhere 


[3]. 














Table 3 
Age, Education, and Intelligence in the Psychosomatic and Psychoneurotic Groups 

Age* Education* Intelligence* 

M SD M SD M SD 

ET Te nee Ne et 31.33 9.63 11.06 2.58 108.4 9.35 
(N = 33) 

Psychosomatic 32.46 9.99 10.44 3.38 107.0 11.65 
(N = 24) 





*The differences between the means and variances of the groups were not significant. 








222 


21. FM-+m:C’+c—Expressed in manner similar 
to variable 20. 


22. W/M 
23. A% 
24. H—Excludes (H) 


25. (3) 

26. (4)+(A) 

27. Anatomy—lIncludes charts, X-rays. 

28. Oral—Number of responses pertaining to oral 
activity or emphasizing oral region. 

29. Wheeler signs—Signs of male homosexuality 
as selected by Wheeler [4]. All twenty are re- 
tained here; protocol’s score depends on total 
number of signs present. 

30. Wheeler’s signs 2, 4, 5, 6, 8, 11—Confusion in 
sexual identity is the rationale for these signs. 

31. P—Beck’s list [1] 


32. O 
33. Number of questions the subject asks during 
the test. 


34. Symmetry remarks 

35. Cn, des, rem—Number of comments, descrip- 
tive remarks, or enumerations regarding color; 
generally not scored as color determinant. 

36. Response total 

37. Number of rejects 

38. Mean initial response time, unrejected cards. 

39. Mean deviation of initial response times. 


The two groups were tested by chi square 
for their association with these Rorschach vari- 
ables. The Rorschach cutting points (tri- or 
dichotomy) were utilized which had been em- 
ployed for the parent series of 90 cases; these, 
avoiding extreme categorizations, were set a 
priori as closely as possible to the medians or, 
for the few variables which were trichoto- 
mized, the points of tripartition. Rorschach 
variables which significantly differentiated the 
PN from the PS group (pf = .05) are pre- 
sented in Table 4, together with their confi- 
dence levels and coefficients of association. ‘The 


Leo Shatin 


mean scores for these variables are included 


in Table 4. 
Discussion 


Three of the six Rorschach variables which 
differentiate between the two groups pertain 
to the utilization of color (Table 4). The use 
of and response to color in the Rorschach test 
is a main area of difference between the psy- 
chosomatic and the psychoneurotic patients of 
this study. The psychosomatic group uses sig- 
nificantly more color, color which is less con- 
trolled by the form element and which is com- 
paratively unbalanced by M in the Experience 
Balance. In terms of the usual interpretation 
of these variables (Rorschach, Klopfer, Beck), 
the PS group is more reactive emotionally, 
more labile, and more apt to experience affec- 
tive states in response to environmental stim- 
uli. The extratensive quality is more marked. 

The PS group is also more prone to give 
anatomical responses. This might perhaps be 
expected on the basis of the physiological dys- 
function which in fact is present and provides 
a realistic basis for concern as expressed 
through anatomical content. 

The PN group shows greater repression or 
suppression, more urgency toward withdrawal 
from the projective situation which calls for 
self-expression and the lowering of defensive 
barriers. The trend is manifested in a greater 
number of Rorschach rejections. The PS 
group, in contrast, is less prone to engage in 
repressive or suppressive withdrawal. This is 
consistent with its stronger emotional reactiv- 
ity and lability, as reflected in higher Sum C 
and weaker control over the color responses 
(previously discussed and shown in Table 4). 


Table 4 


Rorschach Variables which Significantly Discriminate between the 
Psychoneurotic and Psychosomatic Groups 











Differentiating p ce Direction of Mean Score 

Rorschach Variable (Chi-square test) Association PN PS 
3. S+s .04 40 Psychosomatic 1.64 2.00 

10. SumC .02 4 Psychosomatic 1.29 2.96 

11. Uncontrolled color <.01 51 Psychosomatic 1.48 2.75 

18. M—SumC .03 42 Psychoneurotic +0.59 —1.31 

27. Anatomy 02 45 Psychosomatic 0.67 2.50 

37. Rejects -02 Ad Psychoneurotic 1.58 0.54 





*Coefficient of association with 
8x2 classifications are corrected as 3x38. 


correction for broad categories [2, p. 363]. 


Psychoneurosis and Psychosomatic Reactions 223 


Two Rorschach variables, not included in 
Table 4, suggest discriminative trends between 
the PN and PS groups. These lie just beyond 
the limits of statistical significance (p = .06) : 
m and &. In each instance the association is 
with PS. They lead to an hypothesis which 
warrants further examination, the hypothesis 
that the psychosomatic patients experience 
more inner tension than the psychoneurotic pa- 
tients and that this tension is not so effectively 
bound to specific defense mechanisms. 


Table 5 


Psychoneurotic and Psychosomatic Groups Com- 
pared for Rorschach Adjustment Index 














(RAT) 
t F 

Group Mean SD (Means) (SD) 
Psychoneurotic —1.47 3.76 

(N=33) 1.36 2.08 

(.18)* (.05)* 

Psychosomatic —0.23 2.59 

(N24 





*p in parentheses. 


Comparison of the Rorschach 
Adjustment Indices 


Lastly, the two groups were compared with 
respect to their general adjustment as meas- 
ured by a composite Rorschach Adjustment 
Index [3]. The results are given in Table 5. 
The mean adjustment score (or RAJ) tends to 
be lower for the PN than the PS, suggesting 
a less favorable level of adjustment, but the 
difference is not statistically significant. An F 
test between the variances of the Rorschach 
Adjustment Index for the PN and PS groups 
was significant at the .05 level of confidence 
(Table 5). We therefore reject the hypothe- 
sis that the PN and PS groups represent the 
same population; they come from populations 
which do differ with regard to Rorschach ad- 
justment level. The psychosomatic group con- 
tains more extreme types of patients, with 
greater severity of Rorschach maladjustment 
on the one hand and better Rorschach adjust- 
ment on the other. 


Summary and Conclusions 


Rorschach tests were administered to 33 pa- 
tients with psychoneurotic disorders (PN) 
and 24 patients with psychosomatic disturb- 


ha he 


ances (PS). The protocols were scored for 39 
variables; each variable was then tested by 
chi square for independence from the two di- 
agnostic groups. 

The two groups were found to be different 
in several respects. The PS used more color 
than did the PN and at the same time exer- 
cised a lesser degree of control over that color 
(Sum C, M—Sum C, and Uncontrolled 
color). The PN gave more anatomical con- 
tent, but less often utilized white space in its 
responses. It rejected cards more frequently. 
In terms of the usual clinical interpretation 
of these variables, the PS is more reactive emo- 
tionally, more labile, and more apt to experi- 
ence affective states in response to environ- 
mental stimuli. It shows stronger oppositional 
trends which are frequently directed outward- 
ly as emotionally-toned stubbornness or nega- 
tivism, and displays greater concern over the 
body’s functioning. —TThe PN group is more 
avoidant in the test situation and tends to hold 
back from immediate emotional responsiveness. 

The two groups were also compared for 
Rorschach level of adjustment. The means 
were not significantly different, but an F test 
indicated (p = .05) that the samples come 
from populations which differ with regard to 
Rorschach adjustment: greater variability oc- 
curs in the PS group. 

These broad diagnostic groupings, psycho- 
somatic and psychoneurotic reactions, do differ 
in psychological characteristics as measured by 
objective analyses of Rorschach productions. 
This investigation stresses differences between 
the psychosomatic and psychoneurotic cate- 
gories. 


Received October 6, 1951. 


References 


1. Beck, S. J. Rorschach’s test: I. Basic processes. 

New York: Grune & Stratton, 1944. 

Garrett, H. E. Statistics in psychology and edu- 

cation. New York: Longmans, Green, 1947. 

3. Shatin, L. Relationships between the Rorschach 
test and the Thematic Apperception Test. Un- 
published doctor’s dissertation. Harvard 
Univer., Feb., 1951. 

4. Wheeler, W. M. An analysis of Rorschach in- 
dices of male homosexuality. J. proj. Tech- 
niques, 1949, 13, 97-126. 

5. Veterans Administration Technical Bulletin TB 
10A-78, October, 1947. Nomenclature of psychi- 
atric disorders and reactions. 


iS) 








Rorschach “Shock” for Two Special Populations 


Harold F. Uehling 


Psychiatric Field Service 


Department of Public Welfare, Madison, Wisconsin 


Sanderson [1] has presented norms for the 
reaction time of 50 adult clients of the Buffalo, 
N. Y., Jewish Community Service Society to 
the 10 Rorschach cards. The reaction times 
presented in his paper vary from 7.76 sec- 
onds for Card I to 15.56 seconds for Card 
IX. 

In order to test the generality of Sander- 
son’s objective study, we wish to present the 
comparative data of two additional popula- 
tions. Our “guard” population consisted of 50 
white males, 21 to 45 years of age, who were 
employed at the Wisconsin State Prison, and 
to each of whom the writer administered the 
Rorschach test as a condition of employment. 
The “inmate” population consisted of 50 
prison admissions upon whom the Rorschach 
test was completed by the same examiner. This 
population approximates that of the guard 
candidates in age range and intellectual ca- 
pacity, the main difference being a cultural 
one, in that the average “guard” had reached 
a somewhat higher level of scholastic attain- 
ment. 

It should be fairly obvious that both of 
these populations may be considered to be “‘re- 
luctant,” in comparison to Sanderson’s coun- 
seling group who might be considered rela- 
tively “eager” for testing, since as voluntary 
subjects they were not resistive to the testing 
situation. We might, for this reason alone, ex- 
pect longer reaction times for our two popu- 
lations than for Sanderson’s group. Besides 
the involuntary character of the testing situa- 
tions, both of our groups were naturally sub- 
ject to more than average tension in the cus- 
todial setting in which they were placed. 


Results 


Table 1 presents the mean values and sig- 
mas for the reaction times to the 10 Rorschach 


cards for the three populations under compari- 
son. It is quite evident that the differences be- 
tween Sanderson’s group and each of our two 
groups are very significantly great. The small- 
est ¢ ratio found in any of the twenty com- 
parisons between our data and that of Sander- 
son exceeds the .001 level of chance, whereas 
none of the differences between our two popu- 
lations reaches the .05 level of chance. It is 
concluded that our two samples are roughly 
similar in respect to mean reaction times, and 
that both of them are very significantly differ- 
ent from the reaction times presented by San- 
derson. 

The rank-order correlations of mean reac- 
tion times for our three sets of data showed 
a fairly high degree of relationship, ranging 
from .75 to .82 in each of the three group com- 
parisons. In other words, the same cards tend 
to be comparatively harder and easier for all 
three populations. However, it is interesting 
to note that whereas Sanderson found Card 
IX had the longest reaction time, we found 
that Card VII was the hardest for both 
groups. 


Summary 


We have examined the reaction times to two 
sets of 50 individually administered protocols 
of the Rorschach test, one for probationary 
guards, and the other of prison inmates, for 
the purpose of determining whether the gen- 
eralizations made by Sanderson hold for these 
populations. The ¢ ratios were so great as to 
suggest that there would be less than one 
chance in a billion that the two sets of results 
could have come from the same population. 
Both the guard candidate and inmate groups 
were far apart from Sanderson’s 50 vocational 
counseling subjects. Our average reaction 
times for each of the 10 cards varied from a 


224 


eae 


el ee 


Rorschach “Shock” for Two Special Populations 225 


minimum of 2.6 times to a maximum of 5.5 
times as great as those represented in Sander- 
son’s population. 

It is concluded that the reaction time dif- 
ferences of our two populations are so great in 
absolute magnitude from those reported by 
Sanderson that his norms would be of no value 
for this kind of situation or with reference to 
this special population. It is apparent, in spe- 
cial locations such as a prison, that generaliza- 


tions made on other populations may not hold, 
and that separate norms must be obtained for 
the particular setting in which they are to be 
used. 


Received August 8, 1951. 


References 


1. Sanderson, H. Norms for “shock” in the Ror- 
schach. J. consult. Psychol., 1951, 15, 127-129. 


Table 1 








Average Reaction Time to the Rorschach Cards for Three Separate Populations 





Card Sanderson “Guard” Candidate “Inmate” Group 
RO Mean SD RO Mean SD RO Mean SD 
s 1 7.76 2.0 2 37.1 34.7 2 34.1 34.1 
II 5 11.66 3.3 + 39.1 33.5 5 48.9 41.6 
III 2 9.22 2.4 5 39.5 36.9 6 50.7 46.9 
IV. 4 13.02 2.7 6 47.5 39.8 3 36.3 34.4 
V 3 9.66* 2.7 1 34.3 35.8 4 38.1 37.0 
VI 9 14.70 5.7 7 53.0 37.4 8 58.1 50.5 
Oe eee 8 14.26 3.7 10 60.7 41.4 10 61.5 47.3 
VIIL.. 4 11.46 3.7 3 38.5 29.1 1 30.7 18.5 
a 10 15.56 5.5 x 60.1 41.1 9 60.1 46.1 
X 7 13.54 4.4 9 60.6 39.8 7 52.3 43.8 





*This value, we are advised by Dr. Sanderson, was incorrectly published as 19.66. 








A Note on The WISC and Other Tests of Children 
Six to Eight Years Old 


Bertram D. Cohen and Mary J. Collier 


Indiana University 


In the present paper are presented correla- 
tions obtained when the WISC [4], Revised 
Stanford Binet, Form L [3], and the Grace 
Arthur Point Scale, Revised Form II [1] 
were administered to a group of children rang- 
ing in age from six years, five months to eight 
years, nine months. 

Procedure. The subjects were drawn from 
the first and second years of a local elementary 
school in Bloomington, Indiana. There was no 
systematic method of selecting the children. 
Those included in this study were part of a 
larger continuous program of testing all chil- 
dren in these grades through the clinical train- 
ing program of the Indiana University Psy- 
chological Clinic. The mean age of the group 
studied was 7 years, 5 months with a standard 
deviation of 5 months. 


The Stanford-Binet, the Grace Arthur 
Point Scale, and the Wechsler Intelligence 
Scale for Children were administered to each 
child on three separate days, and in that order. 
The same examiner administered all three tests 
to any one child. The examiners were gradu- 
ate trainees in clinical psychology rather than 
professional clinicians. All of the student ex- 
aminers, however, had previous training in 


Table 1 


Test Performance Characteristics of the Group 








Test N* Mean IQ SD Range 
S-B 53 104.8 15.07 58-133 
WISC FS 51 99.8 14.26 65-128 
WISC VS 51 98.5 14.48 66-129 
WISC PS 51 101.1 14.51 76-125 
G.A. Pt. Se. 49 94.7 16.35 67-144 





*A small proportion of our subjects could not be pres- 
ent for all three tests because of illness or other extra- 
neous factors. Thus there is some variation in the num- 
bers of cases indicated in our tables. 


clinical testing and were closely supervised by 
the writers. 

Results. Table 1 summarizes the test char- 
acteristics of the group studied. Of special in- 
terest, in view of the correlation results pre- 
sented in Table 2, is the relatively high ceil- 
ing of the Point Scale as compared with the 
WISC ceiling scores of our sample. The eta 
index of association as well as the Pearson 
product-moment technique was applied to all 
relationships in order to test the assumption 
of linearity of regression which is required by 
the Pearson r. A test of linearity suggested by 


Table 2 
Intertest Correlations 
S-B G.A. Pt. Sc. 
(N51) (N47) 
WISC FS 85 (r) .80 (eta) 
WISC VS 82 (r) -77 (eta) 
WISC PS .80 (r) 81 (eta) 
S-B (N—49) .71 (eta) 


Fisher [2, p. 237] was applied to each of the 
relationships reported. The findings indicated 
that the WISC—Binet relationships met the 
criterion of linearity. Hence, the coefficients 
reported for this set of relationships are Pear- 
son r’s. The r of .85 between the Binet 1Q’s 
and those of the Full Scale WISC may be 
considered as indicating a high degree of con- 
sistency insofar as it approaches the magnitude 
of reliability coefficients reported for the Binet 
by Terman and Merrill [3]. The correlations 
for each of the subscales likewise indicate good 
consistency. 

The relationships between the WISC and 
the Grace Arthur Point Scale were all found 
to be significantly nonlinear, at or beyond the 
2 per cent level of confidence. Consequently, 


226 


he 
ed 
he 


if- 


Y's 
yn- 


1et 
ns 
od 


nd 
nd 
the 


ly, 


The WISC and Other Tests of Children Six to Eight 227 


we are reporting the eta index of association 
for each of these relationships. The eta coefhi- 
cients of .80, .77, and .81 between the Grace 
Arthur Point Scale and the WISC Full Scale, 
Verbal Scale, and Performance Scale respec- 
tively, are all indicative of definite consistency. 

The form of the curvilinear relationships 
between the WISC scales and the Grace Ar- 
thur indicated that the Grace Arthur has more 
“top” than the WISC scales. That is, as noted 
previously, the Grace Arthur has a higher ceil- 
ing than the WISC and tended to spread the 
children more effectively at the upper levels 
of intelligence. 

The relationship between the Binet and the 
Grace Arthur Point Scale was also evaluated, 
since this information was available and might 
serve to throw some further light on the ob- 
tained results concerning the WISC. The eta 
index of association was employed here, too, 
since a trend towards nonlinearity was indi- 
cated by the Fisher test; P was .14. In this 
instance, r was .53. The eta coefficient of .71 


is somewhat lower than those obtained be- 
tween the WISC and the Grace Arthur Point 
Scale. That a lesser degree of association ex- 
ists between the Grace Arthur and the Binet, 
as compared with the Grace Arthur and the 
WISC, might have been expected since the 
Binet is a predominantly verbal test while the 
Grace Arthur is a performance scale. The 
WISC, on the other hand, includes a _per- 


formance scale. 


Received June 30, 1951. 


References 


1. Arthur, Grace. A point scale of performance 

tests, Revised Form II. New York: Psychologi- 

cal Corp., 1947. 

Guilford, J. P. Fundamental statistics in psy- 

chology and education. New York: McGraw- 

Hill, 1942. 

3. Terman, L. M., & Merrill, Maud M. Measuring 
intelligence. Boston: Houghton Mifflin, 1937. 

4. Wechsler. D. Manual for the Wechsler Intelli- 
gence Scale for Children. New York: Psycho- 
logical Corp., 1950. 


N 








A Comparison of the WISC and the 
Wechsler-Bellevue 


Lois Delattre and David Cole’ 


Occidental College 


The widespread clinical use of the Wechsler- 
Bellevue made it inevitable that, upon the in- 
troduction of the WISC, many psychologists 
would be tempted to use the clues they had 
found important on the adult test in interpre- 
tation of the children’s form. 


Despite Wechsler’s warning in the WISC 
manual [2] that it is not valid to assume that 
the various subtests have equivalent signifi- 
cance on the two scales, the obvious similarities 
in the form of many items offers a strong 
temptation to make the assumption that the 
two tests are measuring the same functions at 
all points, for both children and adults. 


The purpose of the present study was to 
compare the performance of 50 subjects on the 
Wechsler-Bellevue Form I [1] and on the 
WISC, with the aim of investigating the ex- 
tent to which similar results, both in IQ 
score and profile, might be derived from the 
two tests. 


The subjects were 50 children attending 
the public schools in the vicinity of the 
college. They were largely from an upper- 
middle-class socioeconomic level, and were 
slightly above the average in intelligence. Be- 
cause the Wechsler-Bellevue and WISC over- 
lap for only a limited age group, it must be 
recognized that this study compares the two 
tests for that age group only, and for an above- 
average intelligence range within that group. 
Our subjects ranged in age from 10 years, 5 


1The authors wish to express their appreciation 
to the following persons who participated in the 
administration of the tests used in the present study: 
Gordon Footman, Francis Franciscus, William 
Miles, Nedra Newell, Betty Taplin, Virgil Watters, 
Laurice White, [lah Wilstach, Allan Woodhull, 
and Ray Wright. Special appreciation is due Rose- 
mary Lane, who scored each of the one hundred 
tests. 


228 


months, to 15 years, 7 months. Thirty-four 
were boys and 16 were girls. The range of 
1Q’s obtained was from 43 to 132 on the 
Wechsler full scale. 

Several graduate students participated in 
the administration of the tests. All students 
had equal training in test administration, and 
all tests were scored by the same person. Each 
of the 50 subjects had both tests administered 
by the same examiner. A methodological lim- 
itation arises from the fact that in 48 cases, the 
Wechsler was administered first, thus con- 
sistently giving the WISC the advantage of 
practice effects. This limitation was necessary 
because of the need to integrate the present 
study with other work, and may well account 
for the consistently higher IQ’s obtained on 
the WISC. 


Results 


The mean full scale IQ for our group on 
the Wechsler was 106, contrasted with 112 
on the WISC. Performance IQ was consis- 
tently higher than verbal on both tests, which 
is apparently typical for the age group and 
socioeconomic level tested. The mean Wech- 
sler verbal IQ was 102, compared to 108 on 
the WISC. The mean Wechsler performance 
IQ was 109, compared to 114 on the WISC. 
In 70 per cent of our cases, if the performance 
IQ exceeded the verbal IQ on the Wechsler, 
it also exceeded it on the WISC. The verbal 
IQ scores obtained correlate .86, performance 
IQ’s correlate .82, and the full scale IQ’s, .87. 
These figures approximate the reliability co- 
efficients reported for the WISC. 


In all cases positive correlations were found 
between weighted scores for similar subtests 
for the two instruments. These are cited in 
column 1, of Table 1. Among these figures, 


De ata 


on 
12 
is- 


ch 


ice 


nce 


87. 


ind 
ests 


A A Rn NA ON 


A Comparison of the WISC and the Wechsler 229 


the low correlation for picture arrangement is 
noticeable, and suggests that the same factors 
are not measured by this subtest on the two 
instruments. The correlation on many of the 
subtests may be lowered because of differences 
in scoring methods on the two tests. This may 
be particularly relevant in the case of vocabu- 
lary. 


Table 1 
Subtest Comparisons of WISC and Wechsle 














£ oS 

b Sc 
r- pee 
© 6§s fee 
as A 
se3 Spe Se 86k 
rk. Se.96 £24 
i etee ssse% sez 
3 beck Epete £s¢ 
D O29 acess ast 
Information 53 76 38 
Comprehension 48 48 74 
Arithmetic 54 66 48 
Digits 7 68 66 
Similarities .68 70 60 
Vocabulary 55 62 _— 
Picture Arr. 19 60 68 
Picture Comp. 69 72 66 
Block Design 49 54 72 
Object Assembly 56 36 58 
Digit Symbol 65 56 56 





One technique for deriving clinical material 
from the Wechsler profile is to compare the 
position of a verbal subtest with the mean 
weighted score for the verbal tests, and a per- 
formance score with the performance scale 
mean. Our data were analyzed to determine 
the extent to which such a relation occurring 
on a Wechsler profile was likely to be repeated 
on the WISC profile. The results are cited 
in column 2 of Table 1. It appears that for 
group data the relationship of a subtest to the 
verbal or performance mean will most often 
be the same on both tests. For individual cases, 
gross discrepancies may occur. 

Special consideration of those cases where 
the relationship of a given subtest to the 
appropriate mean is reversed on the two in- 
struments affords a technique for evaluating 
the relative difficulty of the subtests on the 
two scales. From such consideration it appears 
that information, comprehension, picture ar- 
rangement, and object assembly are easier on 
the adult Wechsler, while arithmetic, vocabu- 


lary, and digit symbol appear easier on the 


WISC. 


Another approach to profile analysis is to 
consider the relation of the subtests to the 
weighted score for vocabulary. Again the data 
have been analyzed to find the extent to which 
such relationships occurring on the Wechsler 
will also appear on the WISC. The results 
are indicated in column 3 of Table 1. As with 
the case for verbal and performance mean, 
these show general similarity for group trends. 

Lastly, by the method of paired compar- 
isons, the relationships between any two sub- 
tests on the two scales were studied. ‘This 
step was taken in order to check consistency 
of the “sign” approach, as exemplified by 
Wechsler’s contention for Form I that a low 
score on similarities in the presence of high 
vocabulary and information scores is indica- 
tive of schizophrenia. The results of the analy- 
sis are presented in Table 2. Again, the same 
general conclusion may be drawn, that for 
group trends the same relationships appear on 
both tests, but not to such a degree as to afford 


Table 2 
Percentage of Cases Showing Same Relationships 
Between Paired Subtests on both WISC 
and Wechsler 











Subtests Subtests Subtests 
Compared %o Compared Yo Compared % 
I-C 52 A-D §2 V-PA 68 
I-A 56 A-S 62 V-PC 66 
I-D 62 A-V 48 V-BD 72 
I-S 62 A-PA 58 V-OA 58 
I-V 38 A-PC 52 V-DSy 56 
I-PA 62 A-BD 68 
I-PC 66 A-OA 38 PA-PC 48 
I-BD 70 A-DSy 70 PA-BD #4 
I-OA 62 PA-OA 46 
I-DSy 62 D-S 62 PA-DSy 48 
D-V 66 PC-BD 50 
C-A 56 D-PA 62 PC-OA 54 
C-D 52 D-PC 64 PC-DSy 68 
C-S 40 D-BD 60 
C-V 74 D-OA 46 BD-OA 38 
C-PA 48 D-DSy 66 BD-DSy 66 
C-PC 56 
C-BD 56 S-V 60 OA-DSy 48 


COA 38 SPA 48 
C-DSy 50 SPC 64 


S-BD 50 
S-OA 44 
S-DSy 56 











230 


valuable prediction in individual cases. 


Summary and Conclusions 


In this study 50 young persons were admin- 
istered both Wechsler-Bellevue Form I and 
the WISC. Comparisons were made of their 
performances on the two tests. Slightly higher 
IQ’s were obtained on the WISC, possibly 
because of the practice effects from the Wechs- 
ler. A high correlation was found between the 
scores obtained on the two tests. Comparison 
of subtest performances on the two instru- 
ments showed correlations for most subtests in 
the area of .55, reaching a high of .71 for 
digits, and a low of .19 for picture arrange- 
ment. The latter figure is sufficiently low to 
suggest that this item measures something 
different on the two tests. There is some evi- 
dence that information, comprehension, pic- 
ture arrangement, and object assembly are 
easier on the Wechsler, while arithmetic, vo- 


cabulary, and digit symbol are easier on the 
WISC. 


Lois Delattre and David Cole 


A comparsion of the profiles, by scale means, 
vocabulary level, and relationships of different 
pairs of weighted scores all suggest that for 
group trends similar relationships will appear 
on both scales, but that such similarity is not 
large enough to have predictive value in the 
individual case. 

From the above the conclusion may be 
drawn that the WISC provides an instrument 
which will measure IQ very comparably to 
the Wechsler. The “clinical sign” approach 
used on Wechsler profiles has limited appli- 
cation to the WISC when working with in- 
dividual records. 


Received July 25, 1951. 


References 


1. Wechsler, D. The measurement of adult intelli- 
gence. (3rd Ed.) Baltimore: Williams & Wil- 
kins, 1944. 

2. Wechsler, D. Wechsler Intelligence Scale for 
Children. New York: Psychological Corp., 1949. 


<a tia! 


A Note on the Validity of the Wechsler Intelligence 
Scale for Children 


Grace Thompson Altus 


Santa Barbara County Schools 


The wide acceptance of the Wechsler-Belle- 
vue Intelligence Scale has undoubtedly given 
impetus to the use of the relatively new Wech- 
sler Intelligence Scale for Children. This note 
presents a minor contribution in the direction 
of validation of the latter with data on the 
relationship between WISC scores and two 
widely-used group tests, one of intelligence and 
one of reading. Although this reverses the cus- 
tomary procedure of validating group tests 
against individual tests, the community of 
function among intelligence and achievement 
batteries is sufficiently ‘marked so that one 
should legitimately expect rather high correla- 
tions in these areas if the individual test is to 
be used to advantage in school guidance. 


A sample of 55 students from Lompoc Jun- 
ior High School in Santa Barbara County was 
selected so as to represent the total student 
body in terms of age, sex, proportion in each 
grade, proportion of bilinguals, and IQ de- 
rived from the California Test of Mental 
Maturity (CTMM.). As the average age was 
13 years, 7 months, the mean IQ 100 with a 
standard deviation of 16 points, and the ratio 
of boys to girls almost exactly equal, the sam- 
ple appears to be typical of similar grade en- 
rollments. Reading scores also showed normal 
grade placement and scatter. 


All subjects were given the WISC by the 
writer. Scores were already available from the 
CTMM and the Progressive Reading Test in 
consequence of an all-school testing program 
in October of 1950. Correlation of the WISC 


Full Scale IQ vs. the CTMM Total IQ 
yielded an r of .81, which is perhaps typical of 
the intercorrelation of accepted intellectual 
measures and very close to the correlation 
claimed by the authors of the CTMM for its 
relationship with the Stanford-Binet. The 
scattergram showed some curvilinearity, with 
group test scores over 119 relatively undiffer- 
entiated by the individual test. 

Mental ages were then computed from the 
Verbal Scale of the WISC as of the date on 
which reading tests were given and converted 
into grade levels. Correlation of these WISC 
expectancy grades with actual reading grade 
placements gave a Pearsonian Coefficient of 
.84, which was reduced only slightly (to .83) 
by ruling out chronological age by means of 
partial correlation. It is of interest to note that 
this figure, representing the relationship be- 
tween an orally-administered verbal intelli- 
gence test and reading scores, was even higher 
than a coefficient of .74 yielded by a correla- 
tion of grade level expectancies derived from 
CTMM mental ages against the same cri- 
terion of reading scores, despite the fact that 
reading was common to both of the latter tests. 

In summary, then, it would appear that the 
relationships found within one small but rep- 
resentative sample of junior high-school stu- 
dents, between the WISC and group test 
scores of reading and intelligence, are suffici- 
ently high to indicate that the WISC prob- 
ably has considerable validity in comparable 
school settings. 


Received July 10, 1951. 


231 








NEW 
AN 


Yo 
BOOKS 


\\ N Hbé 


STS 


DY 


Books 
Abramson, Harold A. (Ed.) Problems of conscious- 
ness. Proceedings of the Second Conference, 
March 19-20, 1951. New York: Josiah Macy, Jr. 
Foundation, 1951. Pp. 178. $3.25. 


The proceedings of the conference on conscious- 
ness consist of three major papers and a verbatim 
transcript of their discussion by twenty-one partici- 
pants. Rapaport’s paper deals mainly with psycho- 
theory illustrated by a case of amnesia; 
Zilboorg’s is chiefly a discussion of the preceding 
paper; and Wright’s is a presentation of a case of 
schizophrenia. The papers are hard to follow be- 
cause of the participants’ numerous and often doubt- 
fully relevant interruptions, some of which descend 
to sheer heckling. It is evident that the discussants 
give-and-take thoroughly, and the 
faithful transcript gives the reader a sense of being 
present. The little new about the 
consciousness. It does provide rich 
source material about the personalities of a group 
of psychiatrists, psychologists, and other social sci- 
entists—L. F. S. 


dynamic 


enjoyed their free 


report reveals 


problem of 


Backus, Ollie, & Beasley, Jane. Speech therapy with 


children. Boston: Houghton Mifflin, 1951, Pp. 
viii + 440. $3.00. 
Although written primarily as a manual for 


teachers of speech, this text, even its title, reflects 
a changed conception of the functions of such 
teachers. Confronted with the inadequacy of 
speech correction programs based largely on drills 
and exercises, the authors enthusiastically adopt the 
orientation that the most important aspect of speech 
correction is the therapeutic relationship between 
the teacher and the pupils, that speech teaching is 
most effectively handled as a special case of group 
therapy with special attention to the needs of in- 
dividuals, and that speech skills should be treated 
as a means of facilitating social interaction, Through 
the book the teacher is referred to as “the thera- 
pist.” Part I attempts within 70 pages to present 
the theoretical basis of this new approach to speech 
instruction and to delineate the implication for 
classroom practice. Part II (350 pages) presents 
“Illustrative Lessons in Group Therapy” at the Pre- 





Note: The reviews were prepared by the Editor 
and Associate Editors, who may be identified by 
their initials. 


Primary, and Intermediate levels. The final 
chapter is concerned with adapting group lessons to 
BL, x. 


individual work.—E 


school, 


Bowlby, 
Geneva: 
York: 


$7 00 
92.00. 


John. Maternal care and mental health. 
World Health Organization, and New 
Columbia University Press, 1951. Pp. 179. 


This is “a report prepared on behalf of the 
World Health Organization as a contribution to the 
United Nations program for the welfare of home- 
less children” and constitutes No. 2 in the Mono- 
graph series of WHO. To obtain background for 
the report, Dr. Bowlby several European 
countries as well as the United States. In each he 
discussed the problem with local workers, observed 
local practices, and familiarized himseif with the 
pertinent research findings. Part I summarizes the 
evidence regarding the adverse effects of maternal 
deprivation. And although the evidence is not as 
complete as might be wished, the author believes 
that it forces the conclusion: “prolonged depriva- 
tion of the young child of maternal care may have 
grave and far reaching effects on his character and 
so on the whole of his future life.” For this 
he says “deprived children, whether in their 
own homes or out, are a source of social infection 
as real and serious as are carriers of diphtheria and 
typhoid.” Part II includes a discussion of the 
causes of maternal deprivation in western society 
and an evaluation of a number of approaches to the 
prevention of such deprivation. This is an im- 
portant and valuable document. It contains tables 
based on data not available in tabular form in 
many of the original research articles and chi-square 
tests of significance where these were not reported 
by the researchers. The bibliography includes 159 
references. Finally, the monograph is very well 
written —E, L. K. 


visited 


reason, 


Brody, Eugene B., & Redlich, Frederick C. (Eds.) 
Psychotherapy with schizophrenics: a symposium. 
New York: International Universities Press, 1952. 


Pp. 246. $4.00. 


The symposium on psychotherapy with schizo- 
phrenic patients grew out of a conference held in 
the Department of Psychiatry of the Yale Universi- 
ty Medical School in December, 1950. Three papers 
presented at that conference, and the discussions 
of these papers, constitute chapters in this volume; 
two papers presented elsewhere make up other 


232 


ns 
eC; 
ler 


ee ae ete 


es 


New Books and. Tests 233 


chapters. Rounding out the volume are the two 
opening chapters presumably written for this pub- 
lication by the editors: reviews of literature on the 
concept of schizophrenia and on the treatment of 
schizophrenia. These two chapters and their bibli- 
ographies are extremely important for psychologists. 
The confused state of knowledge about schizo- 
phrenia is clearly indicated throughout. Robert P. 
Knight’s introduction begins by saying, “Schizo- 
phrenia is undoubtedly the major clinical and 
theoretical problem confronting psychiatry today.” 
Lawrence Kubie closes the volume by summarizing 
the Yale Conference with, ‘““We have neither clari- 
fied what we consider to be the essence of the schiz- 
ophrenic process, nor have we taken any theory as 
our working basis.” The contributors to the volume 
are about as eminent as any in this field, so that 
their contributions, though far from definitive, are 
the best available today.—M. K. 


Greenacre, Phyllis. Trauma, growth, and person- 
ality. New York: Norton, 1952. Pp. xii + 328. 
$4.50. 


A collection of fourteen psychoanalytic papers, 
thirteen of which were published in journals or 
yearbooks between 1941 and 1951. As in many psy- 
choanalytic studies, the virtues lie in sensitivity and 
freedom of interpretation, the faults in a failure 
to examine hypotheses critically. Most of the 
chapters are centered on individual case studies 
which emphasize the part played by real childhood 
trauma in the production of neuroses.—L. F. S. 


Hinckley, Robert G., & Hermann, Lydia. Group 
treatment in psychotherapy. Minneapolis: Univer. 
of Minnesota Press, 1951. Pp. x + 136. $3.00. 


This book stems from the authors’ experience 
with group therapy in the Students’ Mental Hy- 
giene Clinic at the University of Minnesota during 
the last decade. The authors’ avowed purpose is 
“to illuminate and point up the processes involved 
in group psychotherapy, to increase familiarity with 
them, and to show by example how they work.” 
They are unusually successful in achieving these 
aims. The book is organized clearly and well. The 
various aspects of group therapy are analyzed, 
treated in thorough and illuminating fashion, and 
illustrated concretely by well-chosen excerpts from 
group sessions. The writing is simple, clear, and 
stimulating. The physical format is excellent. In- 
tellectually and aesthetically it is a pleasure to 
read.—W. A. H. 


Loewy, Herta. The retarded child. New York: 
Philosophical Library, 1951. Pp. 160. $3.75. 


This book carries the title-page subtitle “A 
Guide for Parents and Teachers,” to which the 
jacket adds “and Social Workers.” Actually, the 
content is directed toward educational and training 
procedures with almost no (other) emphasis on 


characteristics or social management. And although 
in his Foreword Professor Max Neuburger stresses 
the author’s devotion to “the psychology and treat- 
ment fit for just this child,” one misses precisely 
these orientations in the text. A major virtue is 
found in Miss Loewy’s belief in the handicapped 
child and his potentialities, however limited the 
latter may be. The exposition of educational pro- 
cedures, while beyond reproach, does not impress 
this reviewer as better than, or even equal to, that 
of many other recent publications. One feels that 
while offering many excellent teaching suggestions, 
this book affords little or no other help or comfort 
for parents or professional workers. Nor does it at- 
tempt to review or incorporate the major historical 
or current contributions of what is now a rather 
fruitful area.—E. A. D. 


Piéron, Henri, Pichot, Pierre, Faverge, J.-M., & 
Stoetzel, Jean. Traité de psychologie appliquée 
Livre II. Méthodologie psychotechnique. Paris: 
Presses Univer. de France, 1952. Pp. viii + 217. 
1000 fr. 


The second volume of this handbook of applied 
psychology deals with the fundamentals of psy- 


chological measurement. Of its four chapters, 
Piéron’s scholarly treatment of experimental methods 
applied to psychometrics is clearly the best. He 
gives greatest emphasis to general methodology, but 
includes a concise, comprehensive survey of objec- 
Most of the tests are well known to 
psychologists; a number of the less 


familiar ones are interesting and seem worthy of 


tive tests. 
American 
further investigation. Pichot’s chapter on person- 
ality measurement .chieves completeness at the ex- 
pense of saying a very little about everything, wit- 
ness the one page, with seven references, devoted 
to the Rorschach. Faverge gives an admirably 
compact survey of statistical methods from histo- 
grams to factor analysis. In the concluding chapter, 
Stoetzel offers a good miniature text on opinion 
polling. It is notable that all but 14 of the approxi- 
mately 250 references he cites are from American 
sources. In quality and scope, the Traité compares 
favorably with American handbooks. The remain 
ing five volumes will be awaited with interest.— 


L. F. S. 


Schwebel, Milton, & Harris, Ella Freas. Health 
counseling. New York: Chartwell House, 1951. 
Pp. ix + 238. $3.00. 


The unfamiliar term health counseling is defined 
as counseling which deals with problems of adjust- 
ment involving illness, physical defect, or miscon- 
ceptions about physiological processes. Except that 
the illustrations are drawn mainly from problems 
encountered by physical education teachers, school 
nurses, and other workers in health areas, the 
presentation does not differ greatly from that ap- 
propriate to other fields of counseling activity. The 








234 New Books and Tests 


book’s main theses are that real counseling, not 
merely “advice,” is needed by youngsters with 
health problems, and that educational workers can 
help by having counseling attitudes and a modicum 
of counseling skill. The relationships with medicine 
are stressed appropriately.—L. F. S. 


Smith, F. V. The explanation of human behavior. 
London: Constable, and New York: Macmillan, 
1951. Pp. ix + 276. $2.75. 


A British book which presents a thoughtful con- 
sideration of the nature of the explaining process 
and the logic of theory construction. To illustrate 
the structure of theories, the positions of McDougall, 
Lewin, Allport, Watson, Hull, and Tolman are 
presented in some detail and are reviewed critical- 
ly. The volume can serve as a valuable adjunct to 
courses in psychological theories and systems.— 
LL. #..s. 


Vaughan, Wayland F. Personal and social adjust- 
ment. New York: Odyssey Press, 1952. Pp. xiv 
578. $4.25. 


A beautiful job of book publishing has been 
wasted on a hodgepodge collection that defies 
description. Important researches, poems, popular 
magazine articles, hymns, cartoons, and testimonials 
are treated with equal weight. Sweeping and un- 
founded generalizations abound throughout; easy 
success formulae are made readily available. A few 
quotations may give a general idea; there are 
hundreds like them: “Schizophrenia may be con- 
sidered as the end result of faulty personality 
development. Bad habits begin early and persist 
late. ” “Eecentricity causes shyness, and seclu- 
siveness exaggerates the peculiarity... .” “A person 
with highly developed social concern is apt to think 
twice before he ‘steps out’ on his wife, especially if 
he is going to become involved with another man’s 
wife. ” “Psychologists recommend the develop- 
ment of frustration-tolerance as one means by which 
a person can maintain his mental balance and still 
face reality. . ” “Tf a medical examination re- 
veals no physical basis for that tired feeling, the 
source will probably be found in an unhappy 
marriage. .. .” “Women have plenty of grounds for 
hating men in general and some one man in par- 
ticular.” The temptation is to go on and on with 
quotable quotes, but what’s the use?—M. K. 


Willey, Roy DeVerl. Guidance in elementary edu- 
cation. New York: Harper, 1952. Pp. xiii + 
825. $5.00. 


The author has assembled in a well-planned text- 
book the recent findings of psychiatry, psychology, 
and the mental hygiene movement in the field of 
elementary education. Since the personality develop- 
ment of a child in the elementary school is still im- 
mature, guidance is most frequently found in a 


controlled environment with the teacher playing the 
leading role. While the author reserves a place for 
the specialist, his emphasis is on the responsibility 
and contribution of the teacher. The book is divided 
into three sections: (1) “Basic Understandings.” 
The author shows how children develop. The 
chapters on childhood emotions and their develop- 
ment and needs are especially valuable. (2) “Tech- 
niques of Understanding Children.” The author 
is not trying to make specialists in the use of tests, 
but hopes to train teachers in the appreciation of 
their use. (3) “Techniques of Guidance.” In this 
section the teachers are helped to apply the knowl- 
edge to their own group. Throughout the book 
there are useful and practical illustrations drawn 
from school situations. Because of the recency of 
our knowledge, many of our present teachers have 
had little or no training in this area. The book 
should prove helpful in training potential teachers, 
but even more valuable for the in-service training 
of teachers who must meet the new demands of 
modern education.—B. M. L. 


Young, Pauline V. Social treatment in probation 
and delinquency. (2nd Ed.) New York: McGraw 
Hill, 1952. Pp. xxvi + 536. $7.00. 


This book is well characterized by its subtitle, 
“Treatise and Casebook for Court Workers, Pro- 
bation Officers and Other Child Welfare Workers.” 
Nearly three-quarters of the first (1937) edition 
have been rewritten in order to present a correct 
picture of current practices. All clinical psychol- 
ogists should find the book useful; the coverage is 
broad and the style of writing almost compelling. 
The use of carefully selected case materials adds 
greatly to the effectiveness of presentation. Of 
especial value to students of psychology are Parts II 
and IV devoted to “Legal Aspects of Probation,” 
and “Utilization of Community Resources.” Al- 
though no amount of reading represents an adequate 
substitute for experience, this volume should go a 
long way in providing the young clinical psychol- 
ogist with appropriate expectations as to the variety 
and complexity of the problems of juvenile delin- 
quency; it should also give him a greater apprecia- 
tion of the need for interprofessional teamwork in 
approaching these problems.—E. L. K. 


Tests 


Friedich-Delys. Test de coup d’oeil technique, 
Planchette F D Y. Brussels, Belgium: Ets Bet- 
tendorff, S. A., (1951). Pp. 8. 


The Planchette F D Y is an industrial formboard 
test. Eighteen irregular shapes are to be transferred 
from the “trial panel” to exactly fill nine recesses 
in the “test panel,” three of which are filled by the 
insertion of one piece, three by two pieces, and 
three by three pieces. It is entirely a speed test, 
the time in seconds being converted to a point score 


han 


aie 


Deb 


]- 


g. 
ds 


Le, 
et- 


ore 


na et th 





New Books and Tests 235 


for which percentile norms are provided. The test 
was standardized on 1831 “young persons twenty 
years old” in Belgium, and is alleged to be useful 
for predicting success in mechanical trades. No 
reliability figures are given, and no evidence of 
validity aside from the doubtfully relevant correla- 
tion of .30 with the Raven matrices. In view of 
the availability of better-standardized tests of 
similar intention, this formboard does not seem to 
be a particularly significant contribution —L. S. F. 


Rey, André. Six épreuves au service de la psychol- 
ogie clinique. Brussels, Belgium: Ets Bettendorff, 
S. A., (1951). Pp. 40. 


The performance tests devised by Professor Rey 
of Geneva, Switzerland, have received passing 
mention in a few American publications, but no 
comprehensive description of them has yet appeared 
in the English language. The present pamphlet, in 
French, is a manual for six performance tests, the 
materials of which are obtainable from a Belgian 
source. The tests are novel, and have no near 
counterparts in the materials familiar to American 
psycuology. In free translation, they are described 
as “perceiving the continuity of direction of broken 
lines,” “memory for serial positions,” “choice based 
on instructions to include and exclude,” “choosing 
and making tools,” “measuring length indirectly,” 
and “unrolling figures.” Normative data indicate 
that the tests are applicable from the four-year to 
the adult level, but not all of them cover the entire 


range. Four of the six may be administered to five- 
year-olds, and four to adults. Percentile norms, by 
ages, are given separately for each test; they are 
not combined into a scale. In comparison with 
American standards, the quantitative information 
is meager. Data on reliability and on correlations 
with other tests or criteria are entirely absent. Even 
the number of cases on which the norms are based 
is stated explicitly in only one instance. In spite 
of such shortcomings, the tests seem to have quali- 
ties of originality and of clinical flavor that com- 
mend them to experimental trial. It may be pre- 
dicted that more will be heard of them.—L. F. §. 


Books Received 


Buhler, Charlotte; Smitter, Faith; Richardson, 
Sybil; & Bradshaw, Franklyn. Childhood prob- 
lems and the teacher. New York: Holt, 1952. 
Pp. xii + 372. $3.75. 

Gregg, Fred M. The people's psychology. New 
York: Vantage Press, 1951. Pp. 493. $5.00. 
MacDougall, Curtis D. Understanding public 
opinion. New York: Macmillan, 1952. Pp. xii 

+ 698. $5.00. 


Sifferd, Calvin S. College and you. Bloomington, 
Ill.: McKnight & McKnight, 1952. Pp. 111. 
$2.50. 

Smith, M. B. The single woman of today. New 
York: Philosophical Library, 1952. Pp. xiv + 
130. $2.75. 




















HISTORY OF 


AMERICAN PSYCHOLOGY 
by A. A. Roback 


Here is the first history of American Psy- 
chology ever to appear, showing through 
development stages how this vastly signifi- 
| cant aspect of human study reached its 
| present importance, The volume presents an 
over-all picture covering three centuries, in- 
cluding the numereus divisions and activities 
of the powerful American Psychological 
Association. 

Author of more than twenty books on 
human behavior (many translated into for- 
| eign languages), and as one who stood close 
to the chief architects of the science, Dr. 


| Roback naturally possesses much first-hand | 


| information. The ever-growing importance of 
| the subjects to students, researchers, psychol- 
| ogists, and intelligent laymen renders this an 
| invaluable tool for study, reference, and gen- 
| uine interest. Copiously illustrated, $6.00 


Expedite Shipment by prepayment 


LIBRARY PUBLISHERS 


& West 20th Street, New York 18, N. Y. 

















BERTRAND RUSSELL’S 
DICTIONARY OF MIND 
MATTER & MORALS 


This exhaustive work offers more than 

| 1000 definitions and opinions of the 1950 
| Nobel Prize winner, arranged as a handy 
| key. Here is Russell’s challenging thought 
| on politics, ethics, philosophy of science, 
| epistemology, religion, mathematical phi- 
losophy, and on topics crucial to an under- 
standing of international affairs today. 
Dipped into casually it rewards the browser 
with stimulating and acute intelligence in- 
sights. Read intensively it will be found in- 
dispensable to a fuller appreciation of one of 
the profoundest minds of our age. $5.00 








Expedite Shipment by prepayment 


| PHILOSOPHICAL LIBRARY 
| Publishers 
| 


15 East 40th Street, Desk 195 
; New York 16, N. Y- 


REE. Oo RON 











REAREAXREANL ANS 


New McGRAW - 








Cro 


HILL Books 





THE MEASUREMENT OF HEARING 


By Ina J. Hinsu, Washington University 
cations in Psychology. Ready in July 


A text for clinical otologists, experimenta! psych 
engineers, and others who ¢o experimental wo. 
basic, experimental information about acoustics 
of hearing ::tc., and epplies this information to 
hearing. Oiders basic information from physics a 
clinical problems. 


THE PSYCHOLOGY OF THINKING 
By W. Epoar Vinacxk, University of Ha 
Psychology. 369 pages, $5.50 


The first modern treatment of the subject, this tex: 
of humar thinking, to formulate the basic prob 
research on the subject. In addition it presents « 
for the understanding of human mental processes a 
ing on a scientific basis by defining methods, pro! 
tively substantiated conclusions. 


PERSONALITY MEASUREMENT 


By Lsonarp W. Fsrouson, Metropolit 
McGraw-Hill Publications in Psychology. Ke 


A college text, providing the first comprehensive a 
able only in extremely scattered sources. This b 


appraisal of the test construction methodologics 
chology must be familiar. 


THE PSYCHOLOGY OF LEARNING 


By James E. Dasgsz, Johns Hopkins Ui) 
Psychology. Ready in September 

A broad treatment of the psychology of learni: ) 

experimental psychologist and covering both 

on experimental evidence rather than upon cul 

mental literature is stressed, and post war exp: 

ment, retroactive inhibition, etc. is highlighted. 
Send for copies on 


lical School. McGraw-Hill Pudii- 


gists, acoustics and communications 
1 hearing, this text brings together 
tro-acoustic equipment, psychology 
veral facets of the measurement of 


psychology pertinent to the student’s 


i. McGraw-Hill Publications in 


is designed to survey the phenomena 
in its study, and to clarify current 
uprehensive theoretical framework 
places the subject of human think- 
ns, working hypotheses, and tenta- 


Life Insurance Co., New York. 
dy in July 


count of the major methods by which 
personality tests have been constructed. ‘he materia! 


covered has heretofore been avail- 


ok brings together a description and 
vith which all serious students of psy- 


: 


ity. McGraw-Hili Publications in 


ritten from the point of view of an 
and animal learning. Emphasis is 


:t theoretical “schools,” Recent experi- 
mental work on such topics as punish- 


6 proval 


McGRAW- HILL BOOK COMPANY, Inc. 


330 West 42nd Street 


New York 36, NN 2 











sd 


La 


re. “= . 
q ‘ , 
. 
‘ 
¢ ; 
e 
+ ‘ 
7 
rrr ea 
s ‘ 
~ : 
. . 
~ 
. 
e 
° 
. 
» 
P ’ 
. 
‘ 
° 
. 
. 
' 
. 
, 
’ 





